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(57) Abstract: A method for the identification of an anti-mycobacterial agent that modulates the activity and/or expression of a 
protein expressed by a Mycobacterium in non-oxygen limiting stationary phase, hypoxic stationary phase or hypoxic growth phase, 
which method comprises: (i) contacting a test agent and a protein selected from RV3133c, Rv2623, Rv2626c, a variant of Rv3 133c, 
Rv2623 orRv2626 and a fragment of Rv3133c. RV2623, Rv2626c or said variant, or a poljniucleotide or expression vector encoding 
said protein; (ii) monitoring the effect of the test agent on the activity and/or expression of said protein, thereby determining whether 
the test agent modulates the activity and/or expression of a protein expressed by a Mycobacterium in non-oxygen limiting stationary 
phase, hypoxic stationary phase or hypoxic growth phase. 
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DORMANCY INDUCED MYCOBACTERIUM VROTEmS 

Field of the Invention 

The present mvention relates to proteins expressed in donnant 
5 Mycobacterium and tP the use of such polypeptides in the prophylaxis, diagnosis and 
treatment of mycobacterial infectioiis. 

Background to the Invention 

Upon infection with Mycobacterium tuberculosis most individuals moimt an 
10 effective immime response causing the infection to enter a latent state. The tubercle 
bacillus may remain in the host for years without causing the symptoms of 
tuberculosis, with possible reactivation later in life. Little is known about the state in 
which the bacilli survive in the host during latent infection. Some evidence suggests 
that cells survive in a state that is similar to the state of bacilli in non-oxygen limiting 
15 or hypoxic stationary phase ciilture. 

Mycobacteria are obligate aerobes, i.e. they require oxygen for growth. 
HoAvever, tubercle bacilli encounter hypoxic enviroimients in acute disease as well as 
in latent infection. Recent genetic evidence suggests that the capability of tubercle 
bacilli to adapt to hypoxic conditions plays a role in vivo. A link has been 
20 estabUshed between oxygen starvation and drug resistance. Upon depletion of 

oxygen in culture the bacillus enters a hypoxic growth phase then terminates growth 
and develops into a dormant form. Importantly, the donnant form of the bacterium is 
resistant against conventional anti-mycobacterials. Hence, hypoxic dormant bacteria 
could, at least in part, be responsible for the observed persistence of the pathogen 
25 during chemotherapy. 

The Wayne dormancy culture system (Wayne and Hayes (1996) Infect, 
Inunun. 64: 2062-2069) is based on growth of the bacilli under oxygen-limited 
conditions in sealed stirred tubes. Initially the cultures grow exponentially and 
consume oxygen rapidly. A temporal oxygen gradient is generated and the cultures 
30 terminate growth when the oxygen concentration reaches a hypoxic threshold level. 
Bacilli in the hypoxic stationary phase are in a state of synchronised non-replicating 
persistence or dormancy. 
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Our knowledge of the molecules involved in the mycobacterial doraiancy 
response is firagmentaiy. Elegant biochemical work has shown that dormant bacilli 
adapt their metabolism to anaerobiosis by switching to nitrate respiration and 
reductive anoination of glyoxylate. Genetic analysis has demonstrated that the 
5 stringent response plays a crucial role in tiie adaptation to hypoxic conditions and 
stationary phase survival. 

Summary of the Invention 

The present inventors have identified four proteins that are up-reg\ilated upon 
10 termination of growth of the attenuated BCG Pasteur ATCC 3 5734 strain of 

Mycobacterium bovis in the Wayne dormancy culture system. The inventors have 
also shown that two of these proteins are upregulated ia Mycobacterium bovis 
, maintained in a non-oxygen limiting stationary phase. ITiese proteins play a role in 
the development of the dormant state and in the maintenance of viability during 
15 dormancy. The proteins are novel screening targets for the identification and 
development of novel pharmaceutical agents. These agents may be used in the 
treatment, prophylaxis and/or diagnosis of mycobacterial infections such as 
tuberculosis. 

Accordingly, the present invention provides: 
20 - a method for the identification of an anti-mycobacterial agent that modulates 
the activity and /or ejqjression of a protein, which method comprises: 

(i) contacting a test agent and a protein selected firom RV3 133c, Rv2623, 
Rv2626c, a variant of Rv3 133c, Rv2623 or Rv2626c and a fragment 
of Rv3 1 33c, RV2623, Rv2626c or said variant, or a polynucleotide or 

25 expression vector encoding said protein; 

(ii) monitoring the effect of the test agent on the activity and/or 
expression of said protein, thereby detemuning whether the test agent 
modulates the activity and/or e35)ression.of a protein expressed by a 
Mycobacterium in non-oxygen limiting stationary phase, hypoxic 

30 stationary phase or hypoxic growth phase. 

an agent which is identifiable by a method of the invention; 

an antibody specific for a protein selected firom Rv3 133c, Rv2623 or 



wo 02/48391 PCT/EPOl/14551 



Rv2626c; 

- a phannaceutical composition comprising a phaimaceuticaUy effective earner ^ 
and as an active ingredient an effective amoTont of an agent or an antibody 
according to the invention; 
5 - a vaccine composition comprising as an active ingredient an effective amount 
of a protein selected from Rv3 133c, Rv2623, Rv2626c and a variant of any 
thereof, or an immunogenic fragment any said protein, and a 
pharmaceutically effective carrier; 

an agent, antibody, pharmaceutical composition or vaccine composition 
10 according to the invention for use in a method oftreatmentofthe human or 

animal body by therapy or in a diagnostic method practised on the human or 
animal body; 

use of an agent, antibody, pharmaceutical composition or vaccine 
• composition according to the invention in. the manufacture of a medicament 

15 for the diagnosis, prophylaxis or treatment of a mycobacterial infection; 

a method of treating a subject suffering from a mycobacterial infection, which 
method comprises administering to said subject a therapeutically effective 
amovmt of an agent, antibody, pharmaceutical composition or vaccine . 
composition according to the invention; 

20 - a method for detecting a mycobacterial infection in a sample, which method 
comprises detecting tiie presence of a protein expressed by a Mycobacterium 
in non-oxygen limiting stationary phase, hypoxic stationary phase or hypoxic 
growth phase or a nucleic acid encoding said protein in said sample, wherein 
said protein is selected from Rv3 133c, Rv2623, RV2626c and a variant of 

25 any thereof ; and 

an in vitro or in vivo method for diagnosing a mycobacterial infection in a 
subject which method comprises monitoring expression of a protein 
expressed by a Mycobacterivm in non-oxygen limiting stationary phase, 
hypoxic stationary phase or hypoxic growth phase, wherein said protein is 

30 selected from Rv3133c, Rv2623, RV2626c and a variant of any thereof. 

use of a protein selected from Rv3 133c, Rv2623, Rv2626c, a variant of 
Rv3133c, Rv2623 or Rv2626c and a fragment of Rv3 133c, Rv2623, Rv2626c 



wo 02/48391 



PCT/EPOl/14551 



or said variant in a method for the identification of an anti-mycobactmal 
agent 

Tjse of a protein selected from Rv3 133c, Rv2623, Rv2626c, a variant of 
Rv3 133c, Rv2623 or Rv2626c and a fragment of Rv3133c, Rv2623, Rv2626c 
5 or said variant in a method for the identification of an agent for diagnosing a 

dormant mycobacterial infection. 
Preferably the mycobacterial infection is tuberculosis. 

Brief Description of the Figures 

10 Fig. 1 . shows the growth of BCG under the conditions of the Wayne 

dormancy culture system. Log A^oo as a function of time is shown. Viable counts at 
selected time potats are indicated. An aerobic exponential preculture was diluted to 
A500 = 0.005 with Dubos Tween-albumin broth (Difco) and incubated in sealed tubes 
under gentie stirring conditions, f and d indicate fading and complete decolorization 

15 of the oxygen indicator methylene blue. Mean values and standard deviations from 
four independent experiments are shown. Viable counts were determined by plating 
appropriate dilutions of the cultures on Dubos oleic albumin agar. For all cfti 
determinations, cultures were checked microscopically for any clumping of cells. 
Significant clumping was not observed. Arrows A to D indicate time points when 

20 samples were taken for the two-dimensional electrophoretic analysis of protein 
contents shown in Fig. 2. 

Fig. 2. shows the temporal profile of protein contents of BCG grown in the 
Wayne dormancy culture system. Protein extracts were prepared at the time points A 
to D indicated by the arrows in Fig. 1 and 100 |ig of total protein was subjected to 

25 two dimensional electrophoresis. A-D show silver-stained gels corresponding to the 
4 time points. Arrows labeled 1 to 4 indicate dormancy-induced protein spots. The 
arrowhead shows the 22 kD allcyl hydroperoxide reductase (Ahpc, Rv2428) which 
has been foxmd to be down-regulated in the hypoxic stationary phase (Sherman et al. 
(1999) Biofactors 1 0:21 1-217). Sizes are indicated in kilodaltons. The experiment 

30 was carried out four times with the same results. Growth phase-dependent protein 
contents were also analysed using isoelectric focussing strips pH 3-10. No additional 
dormancy-induced proteins were detected. 
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Fig. 3. shows donnancy-induced proteins. Protein nxmibers 1 to 4 correspond 
to protein spots 1 to 4 in Fig. 2. The protein names are according to the M. 
tuberculosis H37Rv genome amiotation. To predict lie domain architecture of the 
dormancy-induced proteins their sequence (derived from the M tuberculosis miBy 
5 genome date base TubercuList (Institut Pasteur, Paris, France, 

[http://genoIist.pasteur.fi-/TubercuList/], Data Release R2, March 22, 1999)) was • 
searched against the protein domain famihes date base Pfem (Sanger Centre, 
UK[http://www.s,anger.ac.uk/Software/Pfam/], version 5.5 September 2000). The 
result ofthe search is shown schematicaUy. HSP20, domain found in 20 kD femily 
10 of heat shock proteins (Accession number -PFOOO 11). res_reg, receiver domamfomid 
m response regulators (Accession ntomber PF00072). GerE, helix-tum-helix DNA 
binding domain (luxR subfamily) found in transcriptional regulators (Accession 

. °"^berPF00582). CBS, smaU modules ofunknown function found in diverse 
■ proteins. Paire of CBS domains dimense to form a stable globular domain 

. 15 (Accession number PF00571). 

Fig. 4. shows steady state levels of mRNAs encoding dormancy-induced 
proteins in exponentially growing and dormant cultures. Autoradiograms of 
Northern blbts of total RNA from exponentially growmg Gane A, from time pomt A 
in Fig. 1) and hypoxic stationary phase cultures (lane D. from tine point D m Fig. 1) 

20 are shown.- The blots were hybridised with [a^^PJdATP labeled probes specific to 
the transcripts encoding the doimancy-mduced proteins: 1, 16kD antigen; 2, 23, kD 
response regulator; 3, 32 kD conserved hypothetical protein; 4, 16 kD conserved 
hypothetical protem (Fig. 3). X-ray fihns were exposed for 1 day. The bottom panel 
shows the blots after re-hybridisation with a probe specific to 16s rKNA 

25 demonstrating equal loading of ribosomal RNA. 5 ng of total RNA was loaded. The 
experiment was repeated once with the same results. The up-regulation ofthe 
transcripts was confirmed independently by reverse transcriptase - PGR analysis. 
Sizes are indicated in bases. 

Fig. 5 shows the growth of BCG in roUer bottles. Ejq)onentially growing pre- 
30 cultures were diluted and grown m roUer bottles. Optical density as a fimction of 
time is shown. Arrows A to D indicate time points when samples were taken for the 
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two-dimensional gel electrophoretic analysis of protein contents shovsdn in Fig. 6. 
The experiment was carried out four times with the same results and a representative 
growth curve is shown. 

Fig. 6 shows the temporal profile of protein contents of BCG grown in roller 

5 bottles. Protein extracts were prepared at time points A to D indicated by the arrows 
in Fig. 5 and subjected to two dimensional gel electrophoresis. A-D show silver- 
stained gels corresponding to the 4 time points. Arrows labeled 1 and 2 iadicate 
stationary phase-induced protein spots. The arrowhead shows the 22 kD alkyl 
hydroperoxide reductase (AhpC, Ry2428) which was foxmd to be down-regulated in 

10 the stationary phase. The experiment was carried out four times and a representative 
set of gels is shown. 

Fig. 7 shows a multiple sequence alignment of the 14 kD stationary phase and 
hypoxic stationary phase induced tubercle bacillus protein and the most similar 
orthologs found in other bacteria. The sequence of the 143 amino acid mycobacterial 

15 protein Rv2626c as derived from the M. tuberculosis H37Rv genome data base 

TubercuList (Institut Pastern:, Paris, France, |ttlp://genolist.pasteur.fi:/Tuberct3List/]) 
was searched against the non-redundant GenBank data base (National Center for 
Biotechnology Information, Bethesda, USA, [http://www/ncbi.nlm,nih.gov]) to 
identify orthologs in other bacteria. M. tub.: 143 amino acid protein Rv2626c from 

20 Mycobacterium tuberculosis, accession number A70573; S.coe.: 141 amino acid 
protein from Streptomyces coelicolor, accession number CAB62687; P.aer.: ISiS 
amino acid protein from Pseudomonas aeruginosa, accession number AAG05547-, 
B.sub,: 140 amino acid protein from Bacillus subtilis, accession mmiber B69824. To 
predict the domain architecture of the proteins their sequence was searched against 

25 the protein domain families data base Pfam (Sanger Centre, UK 

[http://www.sanger.ac.uk/Software/Pfam/]). The positions of two predicted CBS 
domaios (Accession number PF00571) are indicated above the alignment. 

Brief Description of the Sequences 

30 

SEQ ID NO:l shows tlie nucleotide sequence of Rv3 133c from the H37Rv strain of 
Mycobacterium tuberculosis. 
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SEQ m N0:2 W to amino add of Rv3I33c fan to H37Rv arain of 

Mycobacterium tuberculosis. 

SEQ ID N0:3 shows the nucleotide sequence of Rv2623 from the H37Rv strain of 
Mycobacterium tuberculosis, 

SEQ ID NO:4 shows the amino acid sequence of Rv2623 from the H37Rv strain of 
^cobacterium tuberculosis. 

SEQ ID NO:5 shows the nucleotide sequence of Rv2626c from the H37Rv strain of 
Mycobacterium tuberculosis, 

SEQ ID NO:6 shows the axnino acid sequence of Rv2626c fix,m the H37Rv strain of 
Mycobacterium tuberculosis. 

SEQ ID NO:7 shows the nucleotide sequence of Rv3 132c fix,m the H37Rv strain of 

Mycobacterium tuberculosis, 

SEQ ID NO:8 shows the amino acid sequence of Rv3132c from the H37Rv strain of 

Mycobacterium tuberculosis, . 

SEQ ID NO:9 shows the amino acid sequence of a variant of Rv2626c from 

Streptomyces coelicolor, 

SEQ ID NO: 10 shows the amino acid sequence of a variant of Rv2626c from 

Pseudomonas aeruginosa, 

SEQIDNO:ll shows the ammo acid sequence ofa variant of Rv2626c from 
Bacillus subtilis. 



Detafled Description of the Tt,vA»«»„ 
Proteins 

The present invention relates to donnancy-induced Mycobacterium proteins 
RV3133C is shown in SEQ ID NO: 1 and SEQ ID NO: 2, Rv2623 is shown in SEQ 
ID NO: 3 and SEQ ID NO: 4 and Rv2626c is shown m SEQ ID NO: 5 and SEQ ID 
NO: 6. Sequence mfonnation for the donnancy-induced mycobacterial proteins of 
the mvention is t^en from the complete genome sequence of the H37Rv strain of 
Mycobacterium tuberculosis (Cole et al, (1998) Nature 393: 537-544) and can be 
found at http://genohst.pasteur.fr/TubercuList/ . 

Proteins of the invention may be in a substantially isolated form. It will be 
understood that the protem may be mixed with carriers or diluents which wiU not 
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interfere with the intended purpose of tiie polypeptide and still be regarded as 
substantially isolated. In particular an isolated protein of the invention will be 
separated from other Mycobacteriton proteins. For example, an isolated polypeptide 
may be obtained by separating the polypqjtide from other Mycobacterial proteins on 
" 5 a 2D-gel and extmcting the polypeptide from the gel. A protein of the invention may 
also be in a substantially purified form, in which case it will generally comprise the 
polypeptide in a preparation in which more than 50%, e.g. more than 80%, 90%, 95% 
or 99%, by weight of the protein in the preparation is a protein of the invention. 
Routine methods, can be employed to purify and/or synthesise the proteins according 
10 to the invention. Such methods are well understood by persons skilled in the art, and 
include techniques such as those disclosed in Sambrook et al. Molecular Cloning: a 
Laboratory Manual, 2"^ Edition, CSH Laboratory Press, 1989, the disclosure of 
which is included herein in its entirety by way of reference. 

The term "variant" refers to a homologues of Rv3133c, Rv2623 or Rv2626c 
1 5 derived from other strains of Mycobacteritun such as the attenuated BCG Pasteur 
ATCC 35734 strain, or from other microorganisms such as bacteria including 
StreptomyceSy Pseudomonas and Bacillus, 

The term "variant" refers to a protein which is sufficiently similar to 
Rv3 133c, Rv2623 or Rv2626c that the variaiit is capable of eliciting an inmmne 
20 . response to Rv3 133c, Rv2623 or Rv2626c.- Preferably the variant can be used to 
raise antibodies to Rv3 133c, Rv2623 or Rv2626c. More preferably the variant can 
be used to generate CDS T-cells that respond to Rv3 133c, Rv2623 or Rv2626c. 

Preferably a varaint of Rv3 1 33c is capable of binding DNA. More preferably 
a variant of Rv3 133c is capable of binding a sensor histidine protein kinase such as 
25 Rv3132c. A variant of Rv3133c preferably has transcriptional regulatory activity. 

. Typically, polypeptides with more than about 65% identity preferably at least 
80% or at least 90% and particularly preferably at least 95% at least 97% or at least 
99% identity, with the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4 or 
SEQ ID NO: 6 are considered as variants of the proteins. Such variants iaclude allelic 
30 variants and liie deletion, modification or addition of single amino acids or groiq)s of 
amino acids within the protein sequence, as long as the polypeptide maintains the • 
ability to generate an immune response which is effective against Rv3 133c, Ry2623 
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or Rv2626q proteins. " 

Amino acid substitutions may be made, for example from 1, 2 or 3 to 10, 20 
or 30 substitutions. TTie modified protein generaUy retains activity as an 
immunogen. Conservative srAstitutions may be made, for example according to the 
following Table. .Amino acids in the same block in the second column and preferably 
in the same line in the third column may be substituted for each other. 



10 



15 



ALIPHATIC 


Non-polar 


GAP 






ILV 1 




Polar-uncharged 


CSTM 






NQ 




Polar-charged 


DE 






KR 


AROMATIC 




HFWY j 



Shorter polypeptide sequences are within the scope of the invention. For 
example, a peptide of at least 20 amino acids or up to 50, 60, 70, 80, 100, 150, 200, 
300, 400 or 500 amino acids in length is considered to fall within the scope of the 
invention as long as it is capable of eUciting antibodies or a CDS T-cell response to 
Rv3i33c, Rv2623 or Rv2626c. 

Proteins for use in the invention may be chemically modified, e.g. post- 
translationally modified. For example, they may be glycosylated or comprise 
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modified amino acid residues. They may also be modified by the addition of histidine 
residues to assist their purification. 
Polynucleotides 

The invention also includes nucleotide sequences for use in methods of the 
5 invention that encode for Rv3 133c, Rv2323 or Rv2626c or variant or firagment of 
any thereof as well as nucleotide sequences which are complementary thereto. The 
nucleotide sequence may be KNA or DNA. Nucleotide sequence information is 
provided in SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 5. Such nucleotides can 
be isolated from cells or synthesised according to methods well known in the art, as 
10 " described by way of example in Sambrook et al, 1989. 

Typically a polynucleotide for use in the invention comprises a contiguous , 
sequence of nucleotides which is capable of hybridizing under selective conditions to 
the coding sequence or the complement of the coding sequence of SEQ ID NO: 1 , 
SEQ ID NO: 3 or SEQ ID NO: 5. 
15 A polynucleotide for use in the invention can hydridize to tiie coding 

sequence or the complement of the coding sequence of SEQ ID NO: 1, SEQ ID NO: 
3 or SEQ ID NO: 5 at a level significantly above background. The signal- level 
generated by the interaction between a polynucleotide of the invention- and the 
coding sequence or complement of the coding sequence of SEQ ID NO: 1, SEQ ID 
20 NO: 3 or SEQ ID NO: 5 is typically at least 10 fold, preferably at least 100 fold, as 
intense as interactions between other polynucleotides and the coding sequence of 
SEQ ID NO: 1 , SEQ ID NO: 3 or SEQ ID NO: 5. The intensity of interaction may 
be measured, for example, by radiolabelling the probe, e.g. with ^^P. Selective 
hybridisation may tj^ically be achieved using conditions of medium to high 
25 stringency. However, such hybridisation may be carried out under any suitable 
conditions known in the art (see Sambrook et al, 1989. For example, if high 
stringency is required suitable conditions include from 0.1 to 0.2 x SSC at 60 up to 
65 °C. If lower stringency is required suitable conditions include 2 x SSC at 60 **C. 
The coding sequence of SEQ ID NO: 1 , SEQ ID NO: 3 or SEQ ID NO: 5 
30 may be modified by nucleotide substitutions, for example from 1, 2 or 3 to 10, 25, 50 
or 100 substitutions. The polynucleotide of SEQ ID NO: 1, SEQ ID NO: 3 or SEQ 
ID NO: 5 may altematively or additionally be modified by one or more insertions 
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and/or deletions and/or by an extension at dther or ho± ends. The modified 
polynucleotide generaUy encodes a polypeptide which has immunogenic activity. 
Alternatively, a polynucleotide encodes a DNA-binding portion of Rv3 133c or a 
firagment of Rv3 133c capable of binding a sensor histidine protein kinase such as 
5 Rv3132c. Degenerate substitutions may be made and/or substitutions may be made 
which would result in a conservative amino acid substitution when the modified 
sequence is translated, for example as shown in the Table above.' 

A nucleotide sequence which is capable of selectively hybridizing to the 
complement oftheDNA coding sequence of SEQ ID NO: 1, SEQ ID NO: 3 or SEQ 
) ID NO: 5 will generally have at least 60%, at least 70%, at least 80%, at least 90%, at 
least 95%, at least 98% or at least 99% sequence identity to the coding sequence o^ 
SEQ ID NO: 1 , SEQ ID NO: 3 or SEQ ID NO: 5 over a region pf at least 20, 
preferably at least 30, for instance at least 40, at least 60, more preferably at least 100 
contiguous nucleotides or most preferably over the full length of SEQ ID NO: 1, 
SEQ ID NO: 3 or SEQ ID NO: 5. 

For example the UWGCG Package provides the BESTFTT program which 
can be used to calculate homology (for example used on its defeult settings) 
(Devereux et al (1984) Nucleic Acids Research 12, p387-395). The PILEUP and 
BLAST algorithms can be used to calculate homology or line up sequences (typically 
on their default settings), for example as described in Altschul (1993) J. Mol. Evol. 
36:290-300; Altschul et al (1990) J. Mol. Biol. 215:403-10. 

• Software for performing BLAST analyses is publicly available through the 
National Centre for Biotechnology Information aittp://www.ncbi nlm n^T, p»,,A 
This algorithm involves first identifying high scoring sequence pair (HSPs) by 
identifying short words of length W in the query sequence that either match or satisfy 
some positive-valued threshold score T when ahgned with a word of the same length 
in a database sequence. T is referred to as the neighbourhood word score threshold 
(Altschul et al, 1990). These initial neighbourhood word hits act as seeds for 
initiating searches to find HSPs contaming them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be 
increased. Extensions for the word hits in each direction are halted when: the 
cumulative ahgnment score fells off by the quantify X from its maximum achieved 
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value; the ciamulative score goes to zero or below, due to the accumulatioii of one or 
more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T and X determine the sensitivity and speed of 
the alignment The BLAST program uses as defaults a word length (W) of 1 1 , the 

5 BLOSUM62 scoring matrix (see Henikoff and Henikoff (1992) Proc, Natl Acad 
Set USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=4, 
and a comparisori of both strands. 

The BLAST algorithm performs a statistical analysis of the similarity 
between two sequences; see e.g., Karlin and Altschul (1993) Proc, Natl Acad, Sci. 

10 USA 90: 5873-5787. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or arnino acid sequences would 
occur by chance. For example, a sequence is considered similar to another sequence 
if the smallest sum probability in comparison of the first sequence to the second 

15 sequence is less than about 1, preferably less than about 0.1, more preferably less 
than about 0.01, and most pireferably less than about 0.00 1 . 

Any combination of the above mentioned degrees of sequence identity and 
mmiTmim sizes may be used to define polynucleotides of the invention, with the 
more stringent combinations (i.e. higher sequence identity over longer lengths) being 

20. preferred. Thus, for example a polynucleotide which has at least 90% sequence 

identity over 25, preferably over 30 nucleotides forms one aspect of the invention, as 
does a polynucleotide which has at least 95% sequence identity over 40 nucleotides. 

The polynucleotides have utility in production of the proteins according to the 
invention, which may take place in vitro, in vivo or ex vivo. The nucleotides may be 

25 involved in recombinant protein synthesis or indeed as therapeutic agents in their 
own right, vitilised in nucleic acid vaccines or gene therapy techniques. Nucleotides 
complementary to those encoding Rv3 133c, Rv2323 or Rv2626c, or antisense 
sequences, may also be used in gene therapy. 
Vectors 

30 The present invention also utUizes expression vectors that comprise 

nucleotide sequences encoding the proteins or variants thereof of the invention. Such 
expression vectors are routinely constructed in the art of molecular biology and may 



wo 02/48391 

PCT/EPOl/14551 



for example involve the use of plasmid DNA and appropriate initiators, promoters, 
enhancers and other elements, such as for example polyadenylation signals which 
may be necessary, and ^ch are positioned in the correct orientation, in order to 
aUow for protein expression. Other suitable vectors would be apparent to persons 
skilled in the art By way of further example in this regard we refer to Sambrook et 
al. 1989, 

Preferably, a polynucleotide for use in the invention in a vector is operably 
linked to a control sequence ^.d^ch is capable of providing for the expression of the 
codmg sequence by the host ceU. i.e. the vector is an expression vector. The term 
"operably linked" refers to a juxtaposition wherein the components described are in a 
relationship permitting them to function in their intended manner. A regulatory 
sequence, such as a promoter, "operably hnked" to a coding sequence is positioned in 
such a way that expression of the coding sequence is achieved under conditions 
compatible with the regulatory sequence. 

Cells ■ . . 

The invention also utilises cells that have been modified to express Rv3 133c, 
RV2323 or Rv2626c or a variant thereof Such cells include transient, or preferably ' 
stable higher eukaryotic ceU lines, such as mammaUan cells or insect cells, using for 
example a baculovirus expression system, lower eukaryotic ceUs, such as yeast or 
prokaiyotic cells such as bacterial or mycobacterial ceUs. Particular examples of 
cells which may be modified by insertion of vectors encoding a p olypeptide 
according to the invention include mammaUan HEK293T, CHO, HeLa, BHK, 3T3 
and COS cells. 

Antibodies 

According to another aspect, the present mvention also relates to antibodies 
specific for a protein that is up-regulated imder hypoxic conditions or in stationary 
phase culture and in particular antibodies that are specific for a protein selected firom 
Rv3133c,Rv2623andRv2626c. Such antibodies are for example usefiil in 
purification, isolation or screening methods involving immunoprecipitation 
techniques or, indeed, as therapeutic agents m their own right. 

Antibodies may be raised against specific epitopes of the proteins according 
to the invention. Such antibodies may be used to block Rv3 133c. or a variant 
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thereof, binding to DNA or to a sensor histidine protein Idnase such as Rv3 1 32c. An 
antibody, or other compound, "specifically binds" to a protein when it binds with 
preferential or high affinity to the protein for which it is specific but does 
substantially bind, or binds with only low affinity, to other proteins. A variety of 

5 protocols for competitive binding or immunoradiometric assays to deteraiine the 
specific binding capability of an antibody are well known in the art (see for example 
Maddox et al, J, Exp. Med, 158, 1211-1226, 1993). Such immuuoassays typically 
involve the formation of complexes between the specific protein and its antibody and 
the measurement of complex formation. 

10 Antibodies of the invention 'may be antibodies to human polypeptides or 

fragments thereof For the purposes of this invention, the term "antibody", unless 
specified to the contrary, includes fragments which bind a polypeptide of the 
invention. Such fragments include Fv, F(ab') and F(ab')2 fragments, as weU as single 
chain antibodies. Furthermore, the antibodies and firagment thereof may be chimeric 

15 antibodies, CDR-grafled antibodies or humanised antibodies. 

Antibodies may be used in a method for detecting polypeptides of the 
invention in a biological sample, which method comprises: 
I providing an antibody of the invention; 

n incubating a biological sample with said antibody under conditions which 
20 allow for the formation of an antibody-antigen complex; and 

in determining whether antibody-antigen complex comprising said antibody is 
formed. 

A sample may be for example a tissue extract, blood, serum and saliva. 
Antibodies of the invention may be bound to a solid support and/or packaged into 

25 kits in a suitable container along with suitable reagents, controls, instructions, etc. 
Antibodies may be linked to a revealing label and thus may be suitable for use in 
methods of in vivo or in vitro imaging of Rv3 133c, Rv2623 or Rv2626c, for 
example, in a method of diagnosis. 

Antibodies of the invention can be produced by any suitable method. Means 

30 for preparing and characterising antibodies are well known in the art, see for example 
Harlow and Lane (1988) "Antibodies: A Laboratory Manual", Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY. For example, an antibody fiiay be 
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produced by raising antibody in a host animal against tbe whole polypeptide or a 
fragment thereof, for example an antigenic epitope thereof, herein after the 
"immunogen". 

A method for producing a polyclonal antibody comprises immumsing a 
suitable host animal, for example an e3q)erimental animal, with the immunogen and 
isolating immunoglobulins from the animal's serum. The animal may therefore be 
inoculated with the immunogen, blood subsequently removed from the animal and 
the IgG fiactibn purified. ' • 

A method for producing a monoclonal antibody comprises immortalising 
cells which produce the desired antibody. Hybridoma ceUs may be produced by 
fusing spleen cells from an inoculated experimental animal with tumour cells (Kohler 
and Milstein (1975) Nature 256,495-497): 

An immortalized ceU producing the desired antibody may be selected by a 
conventional procedure. The hybridomas may be grown m culture or mjected 
intraperitoneaUy for formation of ascites fluid or into the blood stream of an 
aUogenic host or immunocompromised host. Human antibody may be prepared by in 
vitro immunisation of human lymphocytes, followed by transformation of the 
lymphocytes with Epstem-Bjair virus. 

For the production of both monoclonal and polyclonal antibodies, the 
experimental animal is suitably a goat, rabbit. rat or mouse. If desired, the 
immunogen may be administered as a conjugate in which the immunogen is coupled, 
for example via a side cham of one of the amino acid residues, to a suitable carrier. 
The carrier molecule is typicaUy a physioIogicaUy acceptable carrier. The antibody 
obtained may be isolated and, if desired, purified. 
25 Monitoring activity 

An important aspect of the present mvention is the use of proteins that are 
upregulated under hypoxic conditions or in stationary phase culture in screening 
methods. The screening methods may be used to identify substances, that bmd to 
hypoxic growth phase, hypoxic stationary phase or non-oxygen limiting stationary ' 
phase induced proteins and in particular vMch bind to Rv3 133c, Rv2623 or 
Rv2626c. Screening methods may also be used to identify agonists or antagonists 
which may modulate Rv3133c activity, inhibitors or activators of Rv3133c 
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transcriptional activity, and/or agents which tip-regulate or down-regulate Rv3133c, 
Rv2623 or Rv2626c e3q)ression. 

Any suitable format may be used for the assay. In general terms such 
screening methods may involve contacting a dormancy-induced or a stationary-phase 

5 induced protein with a test agent and monitoring for binding of the test agent to flie 
protein. The protein may be incubated with a test agent. Modulation of Rv3 133c ■ 
activity may be determined. In a preferred aspect, the assay is a cell-based assay. 
Preferably the assay may be carried out in a single well of a microtitre plate. Assay 
formats which allow high throughput screening are preferred. 

10 Modulator activity can be determined by contacting cells expressing a 

dormancy-induced or a stationary-phase induced protein with a substance under 
investigation and by monitoring an effect mediated by the protein. The protein may 
be naturally or recombioantty expressed. Preferably, the assay is carried out in vitro 
using cells expressing recombinant protein. Preferably, control e3q)eriments are 

15 carried out on cells which do not express the protein of interest to establish whether 
the observed responses are the result of activation of the protein. Typically the cells 
are transfected with a Mycobacterium and growri in oxygen defficient conditions. 
More preferably, the cells are mycobacterial cells grown in conditions suitable for 
hypoxic growth, conditions that induce the hypoxic stationary phase or conditions 

20 that induce the stationary phase in the presence of oxygen. • 

. The binding of a test agent to a dormancy-induced or a stationary-phase 
induced protein can be determined directiy. For example, a radiolabelled test agent 
can be incubated with a dormancy-induced or a stationary-phase induced protein and 
binding of the test agent to the protein can be monitored. 

25 Agents Ihat inhibit the interaction of Rv3 133c or a variant thereof with DNA 

or with a sensor histidine protein kinase may also be identified through a yeast 2^ 
hybrid assay or other protein interaction assay such as a co-immunoprecipitation or 
an ELIS A based technique. 

Assays may be carried out using cells expressing Rv3 133c, and incubating 

30 such cells with the test agent optionally in the presence of a sensor histidine protein 
kinase Rv3 132c binding of Rv3 133c to the sensor histidine protein kinase may be 
determined. Alternatively, binding of Rv3 133c to DNA or transcriptional activity 



wo 02/48391 



PCT/EPOl/14551 



, . 17 

may be momtored. The results of the assay are ^ically compared to the results 
obtained using the same assay in the absence of the test agent. 
Monitoring expression 

Assays may also be carried out to idsatify agents which modijfy Rv3 133c, 
5 Rv2623 or Rv2626c ejqpression, for example agents which up- or doAvn- regulate 
. expression. Such assays may be carried out for example by using antibodies for 
Rv3133c, Rv2623 or Rv2626c to monitor levels of expression following induction of 
dormant condition. Alternatively, expression may be momtored by deteimining the 
effect of the test agent on Rv3 1 33c, Rv 2623 or Rv2626c mRNA levels. Preferably, 
10 mRNA levels are monitored in a Mycobacterium grown in oxygen-deprived 
conditions. 

Additional control ^qierimeats may be carried out 
Test agents ' . 

Suitable test agents which can be tested in the above assays include 

1 3 combinatorial libraries, defined chemical entities and compounds, peptide and 

peptide mimetics, oligonucleotides and natural product libraries, such as display (e.g. 
phage display libraries) and antibody products. . 

Typically, organic molecules will be screened, preferably small organic 
molecules which have a molecular weight of ftom 50 to 2500 daltons. Candidate 

20 products can be biomolecules including, saccharides, fetty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof Candidate 
agents are obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. Known pharmacological agents may be subjected to directed, or 
random chemical modifications, such as acylation, alkylation, esterification, 

25 amidification, etc. to produce structural analogs. 

Test substances may be used in an initial screen of, for example, 10 
substances per reaction, and the substances of these batches which show inhibition or 
activation tested individually. Test substances may be used at a concentration of 
firom InM to 1 pOO(xM, preferably fixim l^M to lOOuM, more preferably from 1 nM 

30 tolOpM. / . 
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Diagnostic agent 

Diagnostic methods for the detection of nucleic acid molecules, such as 
mRNA, encoding a polypeptide of the invention may involve their amplification, e.g. 
by PGR (tiie experimental embodiment set forth in U.S, Patent No. 4,683,202), ligase 

5 chain reaction (Barany, 1991, Proc. Nati. Acad. Sci. USA 88:1 89-193), self sustained 
sequence replication (GuateUi et al, 1990, Proc. Nati. Acad. Sci, USA 87:1874- 
1878), transcriptional amplification system (Kwoh et aL, 1989, Proc. Nad. Acad. Sci. 
15 USA 86:1 173-1 177), Q-Beta Replicase (LiTardi et aL, 1988, Bio/Technology 
6:1 197) or any other nucleic acid amplification method, followed by the detection of 

10 the amplified molecules using techniques well known to those of skill in the art. 
These detection schemes are especially usefial for the detection of nucleic acid 
molecules if such molecules are present in very lownumbers. 
Medical uses 

Another aspect of the present invention is the use of the agents that have been 

15 identified by screening techniques referred to above in the treatment of mycobacterial 
infection. The treatment may be therapeutic or prophylactic. The condition of a 
patient suffering from a mycobacterial infection can thus be improved. The patient 
naay be a human or animal subject. Generally the animal subject is a mammal, 
typically one which can be naturally or artificially infected by a Mycobacterium. The . 

20 hxmian or animal subject may be a primate, cow or badger. The subject is typically >. 
one which can be naturally or artificially infected by a Mycobacterium. The subject 
may be at risk of a mycobacterial infection, typically because it is resident in a 
location in which mycobacterial infection is endemic. The subject may be 
siisceptible to mycobacterial infection due to malnutrition or infection by other 

25 pathogens, such as HIV. The Mycobacterium is typically pathogenic and capable of. 
infecting mammals, such as those mammals discussed above. The Mycobacterium is 
typically M. tuberculosis, Mmariman, Mkansasih M bovis or M.avium. 

Substances identified according to the screening ihethods outlined above may 
be fonnulated with standard pharmaceutically acceptable carriers and/or excipients as 

30 is routine in the pharmaceutical art. For example, a suitable substance may be 
dissolved in physiological saline or water for injections. The exact nature of a 
formulation will depend upon several factors including the particular substance to be 
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administered and the desired route of administration. Suitable types of formulation 
are fully described in Remington's Pharmaceutical Sciences, Mack Publishing 
Company, Eastern Pemisylvania, 17* Ed, 1985, the disclosure of which is included 
herein of its entirety by way of reference. 

5 A vaccine composition typically also comprises an adjuvant The adjuvant 

may target the peptide to antigen presenting ceUs (APCs) or to compartments in the 
antigen processing pathway, for example acting as a carrier protein. The sequence 
may stmiulate a T helper response, such as a response that favours a CDS T cell 
response, and thus may comprise a T helper (e.g. Thl) cell epitope. 

10 Preferred substances which enhance immunogenicity include sequence from 

the hepatitis B core antigen, sequence from a stress* protein or sequence from 
Clostridium tetani neurotoxin fragment C. The stress protein is typically a bacterial 
(e.g. mycobacterial) heat shock protein (HSP) or a protein which has* homology with 
jsuch a protein, such as mycobacterial or E. coli proteins of the HSP 60 and HSP 70 

.15' Ifamilies (e.g. HSP 65 or HSP 71 of mycobacteria) or mammalian homologue (e.g. 
gp96 of mice or humans, Anthony et al (1 999) Vaccine 17, 373-83). 

The substance may caiise the polypeptide or vector to adopt a particulate 
1 form. The substance may be a virus or vhnis-like particle (such as a yeast Ty particle, 
e.g. as in Allsopp et al (1996) Eur. J, Immunol. 26, 1951-9). The substance may be a 

20 cytokine, such as a cytokine which stimulates a MHC class I restricted T cell 

jresponse or favourable MHC class 11 resticted T cell response (e.g. IL-2, IL-7, IL- 
12, IFN or GMCSF). The substance may be, for example, CFA, a muramyl dipeptide 
(e.g. of a mycobacterial cell wall), monophosphoryl Upid A, lipopolysaccharide (e.g. 
from B. abortus), liposomes, SAF-1, a saponin (e.g. Quil A), keyhole- limpet 

25 hemocyanin, beta 2-microglobulin, mannan (e.g. oxidised mannan), an acrylic based 
microbead, or an emulsion (e.g. oil in water or water ia oil) such as soybean 
emulsion (e.g as in Hioe et a/." (1996) Vaccine* 14, 412-8)/ ' - • ~ - - - 

\ The substances may be administered by enteral or parenteral routes such as 
via oral, buccal, anal, pulmonary, intravenous, intra-arterial, mtramuscular, 

30 intraperitoneal, topical or other appropriate admimstration routes. 

The particular route of administration used may aid the stimulating of a CDS 
T cell response, and thus the polypeptide vector may be provided in a fonn suitable 
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for adininistering by such a route. Delivery by an intramuscular route or by biolistic 
means is preferred. . 

A therapeutically effective amount of a modulator is adminisftered to a patient 
sxiffering from a mycobacterial infection. An effective amount of a polypeptide of 

5 the invention, or a fragment thereof, capable of generating an immune response is 
administered to a subj ect at risk of a mycobacterial infection. Stimulation of an 
immuae response can typically be monitored by detecting antibodies directed against 
Rv3133c, Rv2623 or Rv2626c. Antibodies specific for such proteins may be 
detected using an ELISAassay. 

10 The dose of a modulator may be determined according to various parameters, 

especially .according to the substance used; the age, weight and condition of the 
patient to be treated; the route of administration; and the required regimen. A 
physician will be able to detercEiine the required route of administration and dosage 
for any particular patient. A typical daily dose is from about 0. 1 to 50 mg per kg of 

is ' body weight, according to the activity of the specific modulator, the age, weight and 
conditions of the subject to be treated, the type and severity of the degeneration and 
the frequency and route of administration. Preferably, daily dosage levels are from 5 
mg to 2 g. 

A vaccine composition is preferably adnodnistered in a single dose. One or 
20 more, for example, two, three or four, further doses may be required for long term 
protection against a Mycobacterium infection. Further doses may be against a 
Mycobacterium vakiCXion. Further doses may be administered after a period of from 
1 month to 15 years after the initial dose, for example, 1, 2, 3, 4, 5, 8, 10, 12 or 15 
years. Several fiirther doses may be administered at intervals after the initial dose, 
25 for example at 3 , 5, 1 0 or 1 5 year intervals. 

Nucleic acid encoding a polypeptide of the invention may be administered to 
a mammal as a nucleic acid vaccine. Nucleic acid encoding the polypeptide may be 
administered by any available technique. For example, the. nucleic acid may be 
introduced by needle injection, preferably intradermally, subcutaneously or 
30 intramuscularly. Alternatively, the nucleic acid may be delivered directly across the 
skin using a nucleic acid delivery device such as particle-mediated gene delivery. 
The nucleic acid may be administered topically to the skin, or to mucosal sinfaces for 
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eacample by intranasal, oral, intravaginal or intrarectel administration. 

Uptake of nucleic acid constructs may be enhanced by several known 
transfection techniques, for example those including the use of transfection agents. 
Examples of these agents includes cationic agents, for example, calcium phosphate 
5 and DEAE-Dextran and lipofectants, for example, lipofectam and transfectam. The 
dosage of the nucleic acid to be administered can be altered. Typically the nucleic 
acid is administered in the range of Ipg to Img, preferably to Ipg to lOiig nucleic 
acid for particle mediated gene deUvery and 10|j.g to Img for.other routes. 

Generally a low dose of antigen favours the development of a CDS T cell 
10 response.. Thus a suitable low dose qf the polypsptide or vector may be administered, 
to prevent mycobacterial infection. The polyJ)eptide or vector may thus be in an 
amount and concentration that is suitable for administering to provide an appropriate 
low dose. In a preferred embodiment the vector is administered in the form of 
"naked DNA". 

15 j The following Examples illustrate the inventioa 

Example 1; Detection and identijBcation of dormancy-indnced proteins. 

BCG were grown in the Wayne dormancy culture system. Bacilli were 
harvested at various time points (Fig. 1 , arrows A-D), washed twice in phosphate- 
buffered saline, resuspended in lysis buffer (9 M urea, 4% CHAPS, 50 mM DTT, 
pefabloc [1 mg ml"'], pepstatin [1 ^ig ml''], leupeptin [1 |.ig ml"'] and disrupted with 
0.5 mm glass beads using a Mini Bead Beater (Biospec). Protein concentrations 
were determined using the BioRad protein assay reagents and protocols. 1 GO \ig of 
total protein were subjected to isoelectric focusing using pH 4-7 Immobiline Dry 
Strips and an IPGphor isoelectric focusing unit as recommended by the manufecturer 
(Amersham Pharmacia) for 62000 Vhrs. For the separation in the second dimension 
12.5% sodium dodecyl sulfete polyacrylamide gels were used (Protean Ilxi system, 
BioRad) and proteins were detected by silver staining. Fig. 2. shows a representative 
set of two-dimensional gels, Four proteins showed a drastic increase in then- steady 
state level in the hypoxic stationary phase (Fig. 2, arrows 1-4). The proteins were not 
(arrow 1, 2, 4) or only weakly (arrow 3) detectable in the extracts from exponentially 
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growing cultures (Fig. 2A) and appeared as major spots immediately upon oxygen- 
starvation induced termination of growth (Fig, 2B). Elevated levels of the proteins 
were maintained throughout the hypoxic stationary phase (Fig. 2C, D). To identify 
the dormancy-induced proteins, the excised gel spots were subjected to in-gel 

5 digestion with trypsin to recover the peptides (Shevchenko et ah (1996) Anal. Chem. 
68: 850-858). Peptide sequence tags were generated from selected peptides by 
. nanoelectrospray tandem mass spectrometry using a quadrupole/time-of-flight hybrid 
instrument (QSTAR, PE Sciex). Protein identity was revealed by searchiag sequence 
databases with a combination of the peptide sequence tags and the mass information. 

10 Protein 1 was the 1 6 kD antigen Rv203 1 c previously reported to be induced in 
oxygen-starved cultures of tubercle bacilli (Yuan et al (1996) J. Bacteriol. 178: 
4484-4492). Protein 2 was the 23 kD response regulator Rv3 1 33cc. Proteins 3 and 4 
were the. 32 kD conserved hypothetical protein Rv2623 and the 16 kD (14kD on SDS 
PAGE) conserved hypothetical protein Rv2626c, respectively (Fig. 3). Protein 

15 names and Rv numbers are according to the M. tuberculosis H37Rv genome 
annotation (Cole et al (1 998) Nature 393 : 537 - 544). 

Example 2 1 Transcript levels of dormancy-induced protfeins. . 

To determine whether the increase of the steady state level of tiie dormancy- 
20 induced proteins corrielates with an increase in the steady state level of their mRNAs, 
total RNA was isolated from exponentially growing and hypoxic stationary phase 
cultures and subjected to Northern blot analysis as described (Hutter and Dick (1999) 
FEMS Microbiol. Lett. 178; 63-69). Probes were isolated by PGR using BCG 
genomic DNA as template. The primers were derived from the M tuberculosis 
25 H37Rv genome sequence and were as follows: 1, Rv2031c 
(GCCACCACCCTTCCCGTTCAG, ATGTCGTCCTCGTCAGCACCTACC); 2, 
Rv3133cc (TCGTAGGTGTAGGCGGGTTC, CGGCGATCTGGTTGTTGGT); 3, 
Rv2623 (GGCAGCCGTTCCCACATTG, GGCTGATCGCGCACCACCAC); 4, 
Rv2626c (CCACCGCACGCGACATCAT, CGGAACACGGCGGACCTG); 16S 
30 rRNA (GCCTGGGAAACTGGGTCTAA, TCTCCACCTACCGTCAATCC). The 
identity of the PGR fragments was confirmed by sequencing using^ a Perkin-Elmer 
ABI Prism 377 automated sequencer. Fig. 4 shows higb levels of the transcripts for 
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aU four proteins in dormant badUi. In exponential growing culture the transcripts 
were not detectable or were only weakly detectable. This result indicates that the 
expression of the proteins is regulated at the transcriptional level. 

The up-regulation of the steady state level of the 16 kD antigen mRNA in 

5 oxygen-starved dormant culture is in ^parent contradiction to the down-regulation 
of the transcript in nonpagitated cultures of the tubercle bacillus observed by Hu et 
a/.(1999)J.Bacteriol. 181: 1380-1387. The 16 kD antigen had been previously 
reported to be induced in oxygen-starved stationary phase cultures of tubercle bacilU. 
The protein belongs to the 20 kD smaU heat shock protein family (Fig. 3) and was 

0 shown to possess chaperon function ascribed to other members of the family. The 
three new donnancy-induced proteins were predicted by the M tuberculosis H3 7Rv 
genome project (Cole et al). However, biochemical or genetic data for these proteins 
are not available. Two of the three new donnancy-induced proteins are annotated as 
'conserved hypothetical proteins' (Cole et al. (1998) Nature 393: 537-544), A 

5 similarity search against the protein domain femiUes (pfem) database (1) suggested 
that tiie 32 kD conserved hypothetical protein shown in SEQ ID NO: 4 contains two 
'Universal Stress Protein' (Usp) domains (Fig. 3). This domain is found m the UspA 
protein in E. coll UspA is iq)-regulated m stationary phase cultures and plays a role 
in the survival of growth-arrested ceUs (Diez et al. (2000) Mol. Micobiol. 36: 1494- 

3 1 503). The 1 6 kD conserved hypothetical protein shown in SEQ ID NO: 6 is 

predicted to contain two 'Cystathionine-beta synthase' (CBS) domains (Fig. 3). CBS 
domains are found m functionally diverse proteins. A function for this domain is not 
known. Most intriguing is the dormancy-dependent up-regulation of the 23 kD 
response regulator shown in SEQ ID NO: 2. This response regulator contains a 
'helix-tum-helix' DNA binding domain and is thus lilcely to act as a phosphorylation- 
dependent transcription factor (Fig. 3). Response regulators, together with then 
respective sensor histidine protein kinase, are part of two-component signal 
transduction systems and play key roles in a variety of developmental and adaptive 
processes in bacteria. Thus, it is conceivable that the 23 kD response regulator plays 
a role in the control of the mycobacterial dormancy response. Inspecticai of the 
genomic locus surrounding the goie encoding the 23 kD response regulator in M. 
tuberculosis H37Rv showed that the gene spears to form an operon with the gene 
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Rv3 132c annotated as putatiye sensor histidine protein kinase (Cole et at). The 
genomic organisation is identical in BCG and could indicate that this putative kinase 
represents the sensor involved in the control of the activity of the dormancy-induced 
23 kD response regulator. 
5 . ' • 

Example 3: Detection and identification of stationary phase - induced proteins 

M bovis BCG were grovm at 37°C in Dubos Tween-albuniin broth (BD 
Biosience). Bacilli were sub-cultured once in liquid medium until they reached an 
early exponential growth phase (Agoo = 0-2 • 0.3) before inoculation to an 
10 experimental culture. To grow experimental cultures medium was dispensed in 100 
ml aliquots to roller bottles, 10 x 14 cm, and pre-cultures were diluted to A^qo ^ 0.05 
(5x10^ cfu/ml). Cultures were aerated by incubation on a roller apparatus at 1 ipm. 
The roller bottles were opened daily for turbidity measufemeiits and to allow 
exchange of the air. Growth was monitored by measuring the optical density of the 
15 ■ cultures ia a Ultrospec 3000 spectrophotometer (Pharmacia). 

Protein extracts were prepared from different time points corresponding to the 
exponential pheise, entry into stationary phase, early and late stationary phase (Fig. 5, 
arrows A-D). For the preparation of protein extracts the cells were washed twice in 
phosphate-buffered saline, resuspended in lysis buffer (9 M urea, 4% CHAPS, 50 
20 . mM DTT, pefabloc [1 mg ml'^], pepstatin [1 jxg ml*^], leupeptia [1 jxg m;"^]) and 
disrupted with 0.5 mm glass beads using a Mini Bead Beater (Biospec). Protein 
concentrations were determined using the BioRad protein assay reagents and 
protocols. 1 00 {xg of total protein was subjected to isoelectric focussing using pH 4-7 
Immobiline Dry Strips and a IPGphor isoelectic focusing unit as recommended by 
25 the manufacture (AmershamPharmacia) for 62000 Vhrs. For the separation in the 
second dimension 12.5% sodiuni dodecyl sulfate polyacrylamide gels were used 
(Protean Ilxi system, BioRad). Proteins were. detected by silver staining and the gels 
were evaluated by visiial iospection. Fig. 6 shows a representative set of two- 
dimensional gels. Two proteins of 16kD and 14kD showed a drastic increase in their 
30 steady state level in the stationary phase (Fig. 6, arrows 1 and 2). The two proteins 
were not detectable in the extracts from expoentially growing cultures (Fig. 6A) and 
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appeared as major spots immediately upon termination of growfli (Fig. 6B). Elevated 
levels of tiie proteins were maintained througjiout tiie stationary phase (Fig. 6C, D). 
The 16 kD protein is the BCG counterpart of the 16 kD antigen (Rv2031c). The 14 
kD polypeptide is the counterpart of the protein Rv2626c predicted by the M 
5 tuberculosis H37Rv genome project. 

The marked up-regulation of the 16 kD antigen and the 14 kD protein 
immediately upon termination of growth suggests that they play roles in the entry 
into the stationary phase and/or in maintaining viability of stationary phase bacilli. 
The 1 6 kD antigen has been previously reported to be induced in stationary phase 
10 cultures of tubercle bacilli (Yuan el aL (1996) J. Bacteriol. 178: 4484-4492). 

Example 4: Homologues of the stationary phase-induced and dormancy-mduced 
Rv2626c protein 

Biochemical or genetic data for the new stationary phase-induced 14 kD 

15 protein/dormancy-induced 16 kD protein shown in SEQ ID NO:6 are not available. 
A similarity search against the protein domain families (pfam) database suggested 
that the protein consists of a pair of CBS domains (Figs. 3 and 7). The recently 
discovered CBS domain is named after Cystathionine Beta Synthase where it was 
originally identified. The doniiain is usually present as a pair and this CBS domain- 

20 dimer associates to form a single compact structure. Pairs of CBS domains are foxmd 
in a large number of functionally diverse proteins such as inosihe-monophosphate 
dehydrogenases and chloride channels. Although the role of the CBS domain in 
these proteins is unclear, it may be involved in protein-protein interaction and protein 
regulation. In contrast to these proteins that contain a pair of CBS domains in the 

25 context of other, unrelated domains, the small stationary phase-induced 

mycobacterial protein appears to consist of only a pair of CBS domains, not fused to 
other domains (Figs. 3 and 7). It is interesting to note that small molecular weight 
proteins with the same predicted domain architecture and about 35% sequence 
identity can be found in other bacteria such as Streptomyces coelicolor, Pseudomas 

30 aeruginoas and Bacillus subtilis (Fig, 7). This suggests that the Rv2626c dormancy- 
induced and stationary phase-induced mycobacterial protein is a member of a new 
bacterial protein family. 
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Example 5; Over expression, purffication and phosphorylation of the 23kP 
response regulator protein. 

The donnancy-induced response regulator Rv3 133c is predicted to function 
5 as a phosporylation-dependent transcription regulator. Thus, it is likely that the 
protein binds to target promoters and recruits KNA polymerase to initiate 
transcription of essential dormancy jgenes. Hence, prevention of DNA binding by 
inhibitors should abolish response regulator function. Under the assumption that a 
cognate promoter sequence is available, an in yitro DNA binding assay can be carried 
10 out to screen for inhibitors of the response regulator's DNA binding activity: 

Using gene specific primers derived form the M. tuberculosis H37Rv genome 
sequence (Cole et aL (1 998)) the coding sequence of the Rv3 1 33c gene is isolated by 
PGR and a glutathione S-transferase fusion protein is constructed, overexpressed and 
purified as described in Ulijasz et aL (2000), Biochemistry 39, 11427-11424 and 
15 ■ Sambrook et cd. (1 989) Molecular cloning. Cold Spring Habor Laboratory Press, 
Cold Spring Habor, N,Y. The purified protein is phosphorylated as described in 
Ulijasz et aL nsing 50 mM lithium acetyl phosphate. 

Example 6: Isolation and radioactive labelling of the promoter DNA " - 

20 The cognate promoter DNA is PCR amplified using primers derived firom the 

M. tuberculosis H37Rv genome sequence (Cole aL (1998)) and 5'-end labelled . 
with y-^^P-ATP and T4 polynucleotide kinase according to standard methods as 
described in Sambrook ef a/, (1989), 

25 Example 7: Binding of the phosphorylated response regulator to its cognate 
promoter and gel shift analysis of the binding 

Protein-DNA complexes are formed in a total volume of 1 5 |xl which contain 
4 pmol of phosphorylated response regvdator which was preincubated with the 
. {appropriate inhibitors for 10 min at room temperature in 20 mM HEPES (pH 7.2), 5 
30 rcM MgClj, 0.1 eqM NaaEDTA, 0.5 mM CaClj, 10% glycerol and 0.5 of salmon 
DNA as described in Ulijasz et al. (2000). To start the reaction, "P-labeled promoter 



wo 02/48391 



PCT/EPOl/14551 



27 

DNA (0.65 ng in lul) is added, and the reaction mixture is incubated on ice for 15 
min. 



The resultant complexes are analyzed by non-denaturing polyaaylamide gel 
electrophoresis (PAGE). Gels are dried and exposed to a phosphoiimager as 
described in Ulijasz et al. (2000). 

An inhibitor of the response regulator wiU prevent the binding of the protein 
to its promoter and hence reduce complex formation of the two components. 
Complex formation, or the absence of complex formation is detectable in flie gel shift 

analysis. . * 
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g^AIMS 

1 . A method for the identification of an anti-mycobacterial ^ent that 

5 modulates the activity and/or expression of a protein expressed by a Mycobacterium 
in non-oxygen limiting stationary phase, hypoxic stationary phase or hypoxic growth 
phase, which method comprises: 

(i) contacting a test agent and a protein selected fi:omRV3 133c, Rv2623, 
Rv2626c, a variant of Rv3 133c, Rv2623 or Rv2626c and a fragment 

10 of Rv3 133c, RV2623, Rv2626c or said variant, or a polynucleotide or 

expression vector encoding said protein; 

(ii) monitoring the effect of the test agent on the activity and/or 
e3q)ression of said protein, thereby determining whether the test agent 
modulates the activity and/or ejq^ression of a protein expressed by a 

15 . Mycobacterium in non-oxygen limiting stationary phase, hypoxic 

stationary phase or hypoxic growth phase. 

2. A method according to claim 1 whereih step (ii) composes monitoring 
binding of said protein to the test agent. 

3 . A method according to claim 1 wherein step (ii) comprises monitoring 
20 binding of Rv3133c, a variant thereof or a fragment of either thereof to DNA. 

4. A method according to claim 1 wherein step (ii) comprises monitoring 
binding of Rv3 133c, a variant thereof or a fragment of either thereof to a sensor 
histidine protein kinase. 

5 . A method according to claim 4 wherein the sensor histidine protein 
25 kinase is Rv3 1 32c, a variiant thereof or a fragment of either thereof 

6. A method according to claim 1 whereia step (ii) comprises 
monitoring the transcriptional activity of a gene regulated by Rv3 133c, a variant 
thereof or a fragment of either thereof. 

7. A method for the identification of a diagnostic agent that binds to a 

30 protein expressed by a Mycobacterium in non-oxygen limiting stationary phase, 
. hypoxic stationary phase or hypoxic growth phase, or to a polynucleotide encoding 
said protein, which method comprises: 

(i) contacting a test agent and a protein selected from Rv3 133c, Rv2623, 
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10 



Rv2626c, a variant of Rv3133c, Rv2623, Rv2626c or a fragment of 
Rv3133c. RV2623, Rv2626c or said variant, or apolynudeotide 
encoding said protein; 
(ii) monitoring any interaction between tiie test agent and said protein or 

said polynucleotide, thereby detennining whether the test agent binds 
aprotein or polynucleotide expressed by ^ Mycobacterium mnon- 
oxygen limiting stationary phase, hypoxic stationary phase or hypoxic 
growth phase. 

8. A method-according to any preceding claim wherein said agent is a 
variant or fragment of a protein expressed by a Afycobacterium in non-oxygen 
limiting stationary phase, hypoxic stationary phase or hypoxic growth phase, a 
polynucleotide which encodes a variant or fragment of said protein or a " 
polynucleotide which hybridises under stringent conditions to a sequence eticoding 
said protein. 

15 .9. An agent which is identifiable by a method according to any preceding 

claim. 

10. An agent accorcHng to claim 9 v*ich inhibits activity and/or 

expression of aprotein expressed by a J\^co6acfert«m mnonH3xygenlimiting 
stationary phase hypoxic stationary, phase or hypoxic growth phase. 

20 11. • An antibody specific for aprotein selected fromRv3133c, Rv2623 or 

Rv2626c. 

12. A pharmaceutical composition comprising a pharmaceuticaUy 
effective, carrier and as an active ingredient an effective amount of an agent according 
to claim 9 or 1 0 or an antibody accordiag to claim 1 1 . 

25 13. A vaccine composition comprising as an active ingredient an effective 

amount of a protein selected from Rv3 133c, Rv2623. Rv2626c and a variant of any 
thereof, or an immunogenic fragment any said protein, and a phannaceuticaUy 
effective carrier. 

14. An agent according to claim 9 or 10, an antibody according to claim 
30 11, a phannaceutical composition according to claim 12 or a vaccine composition 
according to claim 13 for use in a method of treatment of the human or axrimal body 
by therapy or in a diagnostic method practised on the human or animal body. 
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15. Use of an agent according to claim 9 or 10, an antibody according to 
claim 1 1 , a pharmaceutical composition according to claim 12 or a vaccine 
composition according to claim 13 in the manufacture of a medicament for the 
diagnosis, prophylaxis or treatment of a mycobacterial infection. 
5 16. Use accordiag to claim 15 wherein said mycobacterial infection is 

tuberculosis. 

1 7. * A method of treating a subject STiffering from a mycobacterial 
infection, which method comprises administering to said subject a therapeutically 
effective amount of an agent according to claim 9 or 10, an antibody according to 

10 claim 11 or a pharmaceutical composition according to claim 12. 

18. A method for preventing a mycobacterial infection in a subject at risk 
thereof, which method comprises admioistrating to said subject a prophylactically 
effective amoimt of a protein selected from Rv3133c, Rv2626c, Rv2623, a variant of 
any thereof and a fragment of any said protein, an agent according to claim 9 or 1 0, 

15 an antibody according to claim 11 or a vaccine composition according to claim 13. 

19. A miethod for detecting a mycobacterial infection in a sample, which 
method comprises detecting the presence of a protein ejqjressed by a hdycobacterium 
ui non-oxygen limiting stationary phase, hypoxic stationary phase or hypoxic growth 
phase or a nucleic acid encoding said protem in said sample, wherein said protein is 

20 selected from Rv3 1 33c, Rv2623, RV2626c and a variant of any thereof. 

20. A method according to claim 1 9 which comprises: 

(i) contactuig a sample and an agent according to claim 9 or an antibody 
according to- claim 1 1; and 

(ii) monitoring binding of said agent or antibody to said sample, thereby 
25 determining whether said sample comprises a protein expressed by a 

Mycobacterium in non-oxygen limiting stationary phase, hypoxic 
stationary phase or hypoxic growth phase, or a nucleic acid encoding 
said protein and hence whether said sample is infected with a 
Mycobacterium, 

30 21 , A method according to claim 20 wherein said nucleic acid is RNA. 

22. An in vitro method of diagnosing a mycobacterial infection in a 
subject, which method comprises a method according to any one of claims 19 to 21 
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and wherein said sample is a sample from said subject 

23. An mvf/ro or i«vrvo method for diagnosing a mycobacterial infection 
in a subject whichmethod comprises monitoring expression of a protein expressed 
by aAr;;<:oteerf«w in non-oxygen limiting stationary phase, hypoxic stationary 

5 Pliase or hypoxic growth phase, wherein said protein is selected from Rv3133c, 
Rv2623, RV2626C and a variant of any thereof. 

24. A method according to claim 23, which method comprises 
administering to a subject at risk of mycobacterial infection an agent accordmg to 
claim 9 or an antibody according to clahn 1 1 and monitoring the response to the said 

10 agent or antibody. 

25. A method according to any one of claims 17 to 24 vdierein said 
mycobacterial infection is tuberculosis. 

26. A method of obtaining a protem selected from Rv3 133c, Rv2623, 
Rv2626c and a variant thereof, which method comprises maii^aining a ■ 
Mycohacterimn under aerobic or anaerobic conditions suitable for inducing non- 
oxygen limiting stationary phase, hypoxic stationary phase or hypoxic growth phase 
exjwessed proteins and isolating the said protein. 

27. Use of a protem selected from Rv3 133c, Rv2623, Rv2626c, a variant 
of Rv3 133c, RV2623 or Rv2626c and a fragment of Rv3133c, Rv2623. Rv2626c or 
said variant in a method for the identification of an anti-mycobacterial agent 

28. UseofaproteinselectedfromRv3133c,Rv2623,Rv2626c, avariant 
of Rv3133c, Rv2623 or Rv2626c and a fragment of Rv3133c, Rv2623, Rv2626c or 
said variant in a method for the identification of an agent for diagnosing a dormant 
mycobacterial infection. 
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5 <110> INSTITUTE OF MOLECULAR AND CELL BIOLOGY 
<120> DORMANCY- INDUCED MTCOBACTERIUM PROTEINS 



10 



15 



<130> P81039 GCW 



<160> 8 



<170> Patentin version 3.0 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<210> 
<211> 
<212> 



1 

651 

DNA 



<213> Mycobacteriiom tuberculosis 



<220> 
<221> CDS 
<222>. -(1)., (651) 

■<400> 1 

gtg gta aag gtc ttc ttg gtc gat gac 
Val Val Lys Val Phe Leu Val Asp Asp 

ctg gtt gac ttg ctt ggg gcc gat ccc 
96 

Leu Val Asp Leu Leu Gly Ala Asp Pro 

20 . . 25 

gcg ggt teg gtc gcc gag gcg atg gcc 

Ala Gly Ser Val Ala Glu Ala Met Ala 
35 40 

gat gtc gcg gtg ctg gat gtc egg ttg 

Asp Val Ala Val Leu Asp Val Arg Leu 
50 55 

ctg tgc cgc gat ctg ttg tec cgc atg 

240 ^ 

Leu Cys Arg Asp Leu Leu Ser Arg Met 
65 70 

etc acg tec tac acc tct gac gag gcc 
288 

Leu Thr Ser Tyr Thr Ser Asp Glu Ala 
85 

ggt gcc age gga tat gtc gtc aaa gac 
336 ^ 
Gly Ala Ser Gly Tyr Val Val Lys Asp 
100 105 



cac gag gtg gtg cgt cgt ggt 



His Glu Val Val 
10 



Arg Arg Gly 
15 



gag ctt gac gtc gta ggt gag 
Giu Leu Asp Val 



agg gtt 
Arg Val 

ccc gat 
Pro Asp 

ccc gat 

Pro Asp 
75 



cct 
Pro 

ggc 

Gly 
60 

ctg 

Leu 



gcc 

Ala 
45 

aac 

Asn 

cgc 
Arg 



atg eta gat gcg 
Asp Ala 



Met Leu 
90 

ate aag 

lie Lys 



Val Gly Glu 
30 

gcg cgc cca 
Ala Arg' Pro 

ggc att gaa 
Gly He Glu 

tgt ctg ate 

Cys Leu He 
80 

att etc gcc 

He Leu Ala 
95 



gga atg gag ttg gcg 

Gly Met Glu Leu. Ala 
110 
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cgc gcc gtc aaa gat gtg ggc get gga egg teg ctg ctg gac aat egg 
384 • 

Arg Ala Val 'Lys Asp Val Gly Ala Gly Arg Ser Leu Leu Asp Asn Arg 
5 115 120 125 

gcc gcg gcc gcg ctg atg gcc aag ctg cgc ggt gcc gcc gag aagVcag 
432 

Ala Ala Ala Ala Leu Met Ala Lys Leu Arg Gly Ala Ala Glu Lys Gin 

10 130 135 ■ . 140 . . 

gac ccg eta tea ggc qtt acc gac cag gag egg acg eta ctg ggc ctg 
480 

Asp Pro Leu Ser Gly Leu Thr Asp Gin Glu Arg Thr Leu Leu Gly Leu 

15 145 150 155 160 

ett age gag ggc ctg ace aac aag cag ate gee gac. ega atg ttc eta 
528 

Leu Ser Glu Gly Leu Thr Asn Lys Gin lie Ala Asp Arg Met Phe Leu 
20 165 ■ . 170 175 

gcc gaa aag acg gtg aag aac tac gtg teg egg ttg ctg gcc aag ctg 
576 

Ala Glu Lys Thr Val Lys Asn Tyr Val Ser Arg Leu Leu Ala Lys Leu 
25 180 .185 190 

ggc atg gaa cgt egg acg caa gee gcg gta ttc gcg acg gag ttg aag 
624 

Gly Met Glu Arg Arg Thr Gin Ala Ala Val Phe Ala Thr Glu Leu Lys 
30 ' ■ ■ 195 200 . . 205 - 

cgc teg egg eca ccc ggt gat gga cea 
651 

Arg Ser Arg Pro Pro Gly Asp Gly Pro 
35 210 215 



<210> 2 

<211> 217 

40 <212> PRT 

<213> Mycobacterium tuberculosis 

<:400> 2 • 

45 Val Val* Lys Val Phe Leu Val Asp Asp His Glu Val Val Arg Arg Gly 
1 5 . 10 - 15 



Leu Val Asp Leu Leu Gly Ala Asp Pro Glu Leu Asp Val Val Gly Glu 
50 . 20 25 30 



Ala Gly Ser Val Ala Glu Ala Met Ala Arg Val Pro Ala Ala Arg Pro 
35 40 45 

55 

Asp Val Ala Val Leu Asp Val Arg Leu Pro Asp Gly Asn Gly He Glu 
50 .55 60 

60 ' " ^ L . i 

Leu Cys Arg Asp Leu Leu Ser Arg Met Pro Asp Leu Arg Cys Leu He 
65 70 .75 80 
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Leu Thr Ser Tyr Thr Ser Asp Glu All Met Leu Asp Ala He Leu Ala 

95 



85 90 



5 Gly Ala Ser Gly Tyr Val Val Lys Asp He Lys Gly Met Glu Leu Al 
■^00 105 110 



a 



Arg Ala Val Lys Asp Val Gly Ala Gly Arg Ser Leu Leu Asp Asn Arg 
• 120 125 



15 



20 



Ala Ala Ala Ala Leu Met Ala Lys Leu Arg Gly Ala Ala Glu Lys Gin 

135 140 

Asp Pro Leu Ser Gly Leu Thr Asp Gin Glu Arg Thr Leu Leu Gly Leu 

•^^^ 155 160 

Leu Ser Glu Gly Leu Thr Asn Lys Gin He Ala Asp Arg Met Phe Leu 
165 170- 3^75 



25 Ala Glu Lys Thr Val Lys Asn Tyr Val Ser Arg Leu Leu Ala Lys Leu 
180 185 190 

Gly Met Glu Arg Arg Thr Gin Ala Ala Val Phe Ala Thr Glu 'Leu Lys 
^ 200 205 

Arg Ser Arg Pro Pro* Gly Asp Gly Pro 
210 215. 

35 

<210> 3 - 
<211> 891 
<212>- DNA 
40 <213> Mycobacterium tuberculosis 

. <220> 
<221> CDS 
45 <222> (1)..(891) 

<400> 3 

50 48^ ^""^ ^""^ ^^"^ tct-ctg gga att ate gtc ggg ate gac gat 

Met Ser Ser Gly Asn Ser Ser Leu Gly He He Val Gly He Asp Asp 

55 IT ^""^ ^""^ ^""^ get egg gat gcg gag 

Ser Pro Ala Ala Gin Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu 
^0 25 30 



60 144 



ttg cga aaa ate cet ctg acg etc gtg cac gcg gtg teg ecg gaa gta 

Leu Arg Lys He Pro Leu Thr Leu Val His Ala Val Ser Pro Glu Val 
• -^^ 40 45 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



4. 

gcc acc tgg ctg gag gtg cca ctg ccg ccg ggc gtg ctg cga tgg cag 
192 

Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gin 
50 55 .60 

cag gat cac ggg cgc cac ctg ate gac gac gca etc aag gtg gtt gaa 
240 

Gin Asp His Gly Arg His Leu lie Asp Asp Ala Leu Lys Val Val Glu 
65 70 .75 80 

cag get teg ctg cgc get ggt ecc ccc aeg gte cac agt gaa ate gtt 
288 

Gin Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu lie Val 
85 90 95 

ccg gcg gca gcc gtt ccc aca ttg gtc gac atg tec aaa gac gca gtg 
336 

Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Ser Lys Asp Ala Val 
100 105 110 

ctg atg gtc gtg ggt tgt etc gga agt ggg egg tgg ccg ggc egg ctg 
384 ' 

Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu 
115 120 125 • . 

etc ggt teg gte agt tec ggc ctg etc cgc cac gcg cac tgt ccg gtc 
432 

Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val 
130 135 140 

gtg ate ate cac gac gaa gat teg gtg atg ccg cat ccc cag caa gcg 
480 

Val lie lie His Asp Glu Asp Ser Val Met Pro His Pro Gin Gin Ala 
145 150 155 160 

ccg gtg eta gtt ggc gtt gac ggc teg teg gcc tec gag ctg gcg acc 
528 ' . 

Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr 
165 .170 175- 

gca ate gca ttc gac gaa gcg teg egg cga aac gtg gac ctg gtg gcg 
576 

Ala lie Ala Phe Asp Glu Ala Ser Arg Arg Asn Val Asp Leu Val Ala 

• 180 185 ■ 190 

ctg cac gca tgg age gac gtc gat gtg teg gag tgg ccc gga ate gat 
624 . 

Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly lie Asp 

195 200 205 

tgg ccg gca act cag teg atg gcc gag cag gtg ctg gcc gag egg ttg 
672 

Trp Pro Ala Thr Gin Ser Met Ala Glu Gin Val Leu Ala Glu Arg Leu 
210 215 220 

gcg ggt tgg cag gag egg tat ccc aac gta gcc ata acc cgc gtg gtg 
720 

Ala Gly Trp Gin Glu Arg Tyr Pro Asn Val Ala lie .Thr Arg Val Val 
225 230 235 240 

gtg cgc gat cag ccg gcc cgc cag etc gtc caa cgc tec gag gaa gcc 
768 

Val Arg Asp Gin Pro Ala Arg Gin Leu. Val Gin Arg Ser Glu Glu Ala 
245 250 "255 
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cag ctg. gtc gtg gtc ggc age egg gge cgc ggc ggc tac gee gga atg 

Gin Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met 
<oO 265 270 

ctg gtg ggg teg gta ggc gaa ace gtt get cag ctg gcg egg acg ceg 

Ala 
285 



Leu Val Gly Ser Val Gly Glu Thr Val Ala Gin Leu Ala Arg Thr Pro 

275 280 



gtc ate gtg gca cgc gag teg ctg act 
891 

Val lie Val Ala Arg Glu Ser Leu Thr 

15 290 295 

<210> \ . * ■ 

<211> 297 . ■ 

20 <212> PRT 

<213> Mycobacteri^a2a tiaberculosis 

<400> 4 - 

Met Ser Ser Gly Aan Ser Ser Leu Gly lie He Val Gly He Asp Asp 

. ^ ; 10 is 

30 Ser Pro Ala Ala Gin Val Ala Val Arg Trp. Ala Ala Arg Asp Ala Glu 

20 25 ... 30 • 

Leu Arg Lys He Pro Leu Thr -Leu Val His Ala Val Ser Pro Glu Val 
•^^ 35 . 40 45 



40 



45 



Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gin 
^0 55 60 

Gin Asp His Gly Arg His Leu He Asp Asp Ala 'Leu Lys Val Val Glu 

70 . 75 80. 

Gin Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu He Val 
85 90 95 



50 Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Ser Lys Asp Ala Val 

■ ' 105 110 

Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu 
115 120 125 

Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val 
60 140 . 

Val He He His Asp Glu Asp Ser Val Met Pro His Pro Gin Gin Ala 

"0 155 160 
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Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr 
165 170 175 

5 

Ala lie Ala Phe Asp Glu Ala Ser Arg Arg Asn Val Asp Leu Val Ala 
180 185 190 * 

10 Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly lie Asp 
195 200 205 

Trp Pro Ala Thr Gin Ser Met Ala Glu Gin Val Leu Ala Glu Arg Leu 
15 210 215 220 

Ala Gly Trp Gin Glu Arg Tyr Pro Asn Val Ala lie Thr Arg Val Val 
225 ' 230 235 240 

20 • 

Val Arg Asp Gin Pro Ala Arg Gin Leu Val Gin Arg Ser Glu Glu Ala 
. 245 250 255 

25 

Gin Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met 
260 265 270 

30 Leu Val Gly Ser Val Gly Glu Thr Val Ala Gin Leu Ala Arg Thr -Pro 
275 280 285 

Val lie Val Ala Arg Glu Ser Leu Thr 
35 290 . 295 

<210> 5 

<211> 429 

40 <212> DNA 

<213> Mycobacterium tuberculosis 

<220> 
45 <221> CDS 

<222> (1)..(429) 

<400> 5 

50 atg acc ace gca cgc gac ate atg aac gca ggt gtg acc tgt gtt ggc 
48 

Met Thr Thr Ala Arg Asp lie Met Asn Ala Gly Val Thr Cys Val Gly 
1 5 . 10 15 

55 gaa cac gag acg eta acc get gee .get eaa tac atg cgt gag cae gac 
96 

Glu His Glu Thr Leu Thr Ala Ala Ala Gin Tyr Met Arg Glu His Asp 
20 • 25 30 

60 ate ggc gcg ttg ceg ate tgc ggg gac gac gac egg ctg cac ggc atg 
144 

He Gly Ala Leu Pro He Cys Gly Asp Asp Asp Arg Leu His Gly Met 
35 40 45 
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15 



20 



25 



30 



50 



55 



7 

etc acc gac cgc gac att gtg ate aaa ggc ctg get gcg ggo ota gac 

Leu Thr Asp Arg Asp lie Val lie Lys Gly Leu Ala Ala Gly Leu Asp 
3" 55 60 

240 ^""^ '^'"^ ^'^^ 9^=° 9ac age ate tac tae 

Pro Asn Thr Ala Thr Ala Gly Glu Leu Ala Arg Asp Ser lie Tyr Tyr 

70 75 80 . 

gtc gat gcg aac gca ago ate cag gag atg. etc aac gtc atg gaa gaa 

Val Asp Ala Asn Ala Ser lie Gin Glu Met Leu Asn Val Met Glu Glu 
85 90 95 

c^t cag gtc cgc cgt gtt ccg gtc ate tea gag cac ego ttg gtc gga 

His Gin Val Arg Arg Val Pro Val lie Ser Glu His Arg Leu Val Gly 
100 105 110 

ate gtc acc gaa gcc gac ate gee cga cac ctg ccc gag cac gcc att 

lie Val Thr -Glu Ala Asp lie Ala Arg His Leu Pro Glu His Ala lie 
115 120 125 

gtg cag ttc gtc aag gca ate tgc teg cce atg gee etc gcc age 

Val Gin Phe Val Lys Ala lie Cys Ser Pro Met Ala Leu Ala Ser 
130. 135 140 



<210> 6 

<211> 143 

<212> PRT 

35 <213> Mycobacterium .tiaberculosis 

<400> 6 

40 Met Thr Thr Ala Arg Asp lie Met Asn Ala Gly Val Thr Cys Val Gly 
^ -5 10 . IS 

Glu His Glu Thr Leu Thr' Ala Ala Ala Gin Tyr Met Arg Glu His Asp 
^5 . 20 25 30 



He Gly Ala Leu Pro He Cys Gly Asp Asp Asp Arg Leu His Gly Met 
35 40 45 

Leu Thr Asp Arg Asp He Val He Lys Gly Leu Ala Ala Gly Leu Asp 
50 55 60 

Pro Asn Thr Ala Thr Ala Gly Glu Leu Ala Arg Asp Ser He Tyr Tyr 
^5 70 75 " ao 



60 Val Asp Ala Asn Ala Ser He Gin Glu Met Leu Asn Val .Met Glu Glu 

85 90 . . 95 . 



His Gin Val Arg Arg Val Pro Val He Ser Glu His Arg Leu Val Gly 
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8 

100 105 110 

lie Val Thr Glu Ala Asp He Ala Arg His Leu Pro Glu His Ala He 
115 120 125 

Val Gin Phe Val Lys Ala He . Cys Ser Pro Met Ala Leu Ala Ser 
130 135 140 

<210> 7 

<211> 1734 

<212> DNA 

15 <213> Mycobacterium tuberculosis 

<220> 

<221> CDS 
20 <222> .(1)..(1734) 

<400> 7 

atg aca aca ggg ggc etc gtc. gac gaa aac gac ggc gcc gca atg cgt 
25 48 

Met Thr Thr Gly Gly Leu Val Asp Glu Asn Asp Gly Ala Ala Met Arg 
1 5 10 15 • 

cca ctg cgt cac acg- etc tec caa eta cgc ctg cac gag ctg ctg gtc 
30 96 

Pro Leu Arg. His Thr Leu Ser Gin Leu Arg Leu His Glu Leu Leu Val 
20 25 . .30 

gag gtg cag gac egg gtc gag cag'atc gtc gag ggc egg gac cgc etc 
35 • 144 

Glu Val Gin Asp Arg Val Glu Gin He Val Glu Gly Arg Asp Arg Leu 
35 40 .45 

gat ggt ctg gtg gag gcc atg etc gtg gtc aca gcg ggc ctg gac ctg 
40 192 

Asp Gly Leu Val Glu Ala Met Leu Val Val Thr Ala Gly Leu Asp Leu 
• 50 55 . 60 

gag gca ace eta cgc get ate gtg cat tea gcg ace age ett gtc gat 
45 240 

Glu Ala Thr Leu Arg Ala lie Val His Ser Ala Thr Ser Leu Val Asp 
65. 70' 75 80 

gcg cgc tat ggc get atg gag gtg cac gac egg cag eat egg gta ttg 
50 288 

Ala Arg Tyr Gly Ala Met Glu Val His Asp Arg Gin His Arg Val Leu 

85 90 ■ . 95. • . 

cac ttt gtc tat gaa ggc ate gac gag gag ace gtt egg egg ate ggc 
55 336 

His Phe Val Tyr Glu Gly He Asp Glu Glu Thr Val Arg Arg He Gly 
100 105 110 

cac eta ccg aaa ggc eta ggc gtc ate ggg ctg etc ate gaa gat ecc 
, 60 384 . 

. His Leia Pro Lys Gly Leu Gly Val He Gly Leu Leu He Glu Asp Pro 
115 120 125 
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aaa ccg tta egg ctg gac gat gtt tct gcg cac ccg gcc tog att ggt 

Lys Pro Leu Arg Leu Asp Asp Val Ser Ala His Pro Ala Ser He Gly 
5 "5 140 

ttt ccg ccg tat cat ccg ccg .tg cgt acc ttc etc ggg gta ccg gtt' 

Phe Pro Pro Tyr His Pro Pro Met Arg Thr Phe Leu Gly Val Pro Val 
10 ^5° 155 . 

egg gtg ege gat gaa.teg tte .ggc act etg tac ctg act gac aag acc 
Arg Val Arg Asp Glu Ser Phe Gly Thr . Leu Tyr Leu Thr Asp Lys Thr 
15 "0 

aac ggg caa ccg ttc age gae gac gac gag gtt etg .gtc eag gcg ctg 
Asn Gly Gin Pro Phe Ser Asp Asp Asp Glu Val Leu Val- Gin Ala Leu 
20 190 

gcg gee gee gcg ggt ate gca gtc gcg aat gcc egg etc tac cag cag 
Ala Ala Ala Ala Gly He Ala Val Ala Asn-Ala Arg -Leu Tyr Gin Gin 
25 2°° 205 

|ct aag gcg egt eag teg tgg ate gag gee acc cgt' gae ate gee ace 

Ala Lys Ala Arg Gin Ser Trp He Glu Ala Thr Arg Asp He Ala Thr 
30 • 215 220 

gag .ttg ttg tec ggc ace gaa ecc gcg acg gtg ttc egg ett gtc gee 

Glu Leu Leu Ser Gly Thr Glu Pro Ala Thr Vai Phe Arg Leu Val Ala 
35 ■ 235, 240 

gcg gag geg etc aag etg acg gcg get gae get gee etg gta gcc gtt 

Ala Glu Ala Leu Lys Leu Thr Ala Ala Asp Ala Ala Leu Val Ala Val 
40 245 - 250 255 

. ccc gtc gac gag gae atg ect gee get gae gtg ggg gag etg etg gtg 

pro val Asp Glu Asp Met Pro Ala Ala Asp Val Gly Glu Leu Leu Val 
45 270 

att gaa- aca gte ggc age get gtg get tec att gtt ggg ega acg att 

lie Glu Thr Val Gly Ser Ala Val Ala Ser He Val Gly Arg Thr He 
50 285 

ocg gtg geg ggc gcg" gtg etg egg gag" gte tte gte aac ggc att ccg 

Pro Val Ala Gly Ala- Val Leu Arg Glu Val Phe Val Asn Gly He Pro 

55 300 

•cga egg gtc gac egg gtc gat ttg gaa ggc ctg gae gaa ctg gee gac 

Arg Arg Val Asp Arg Val Asp Leu Glu Gly Leu Asp Glu Leu Ala Asp 
60 315 

gea^ggt ccg geg etg etg ttg ccg etg egg gee aga ggt acc gta geg 

Ala Gly Pro Ala Leu Leu Leu Pro Leu Arg Ala Arg Gly Thr Val Ala 

330 
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ggt gtc gtt gtt gtg ctg agt caa ggc ggt cca ggg get ttc acc gac 
1056 

Gly Val Val Val Val Leu Ser Gin Gly Gly Pro Gly Ala Phe Thr Asp 
5 340 345 350 

gaa caa etc gag atg atg gee gcg ttc gee gac eag gee gcg ctg get 

1104 

Glu Gin Leu Glu Met Met Ala. Ala Phe Ala Asp Gin Ala Ala Leu Ala 
10 355 360 365 

tgg caa ttg gee act teg caa cgt egg atg cgc gaa etc gac gta ctg 
1152 

Trp Gin Leu Ala Thr Ser Gin Arg Arg Met Arg Glu Leu Asp - Val Leu 
15 370 375 380 

acc gac egg gat cgt ate gee cgt gac etc cat gac cat gtc ate cag 
1200 

Thr Asp Arg Asp Arg lie Ala Arg Asp Leu His Asp His Val lie Gin 

20 385 390 395 400 

egg etc ttc gcg att ggc ctg get ttg cag. ggt get gte.ecg cac gaa 
1248 

Arg Leu Phe Ala lie Gly Leu Ala Leu Gin Gly Ala Val Pro His Glu 
25 405 410 .415 

cgt aat ect gaa gtg cag caa ega etc teg gac gtg gta gac gat ctg 
1296 . * ' 

Arg Asn Pro Glu Val Gin Gin Arg Leu Ser Asp Val Val Asp Asp Leu 
30 ; .420 425 430 

caa gac. gtt ata cag gaa ate egg ace ace att tat. gac ctg cac gga 
1344 

Gin Asp Val lie Gin Glu lie Arg -Thr Thr He Tyr Asp Leu His Gly 

35 435 440 " • 445 

gca teg cag ggt ate act egg etc egg eag ega ate gat gcg gee gta 
1392 

Ala Ser Gin Gly He Thr Arg L6u Arg Gin Arg He Asp Ala Ala Val 
40 450 455 460 

gee caa ttt gee gac teg ggg ttg cgc acc age gtt caa ttc gtg ggt 
1440 

Ala Gin Phe Ala Asp Ser Gly Leu Arg Thr Ser Val Gin Phe Val Gly 

45 465 470 475 480 

cca ttg teg gtg gtc gac age gcg etc gee gat cag gcc gag gcg gtg 
1488 

Pro Leu Ser Val Val Asp Ser Ala Leu Ala Asp Gin Ala Glu Ala Val 
50 485 490 495 

gtt egg gaa gcg gtc age aac gcg .gtt cgc cat gcg aag gee age aeg 
1536 

Val Arg Glu -Ala Val Ser Asn Ala Val Arg His Ala Lys Ala Ser Thr 
55 500 505 510 

ttg acc gtc egg gtc aaa gtc gac gac gac ttg tgc ate gag gtg acc 
' 1584 

Leu Thr Val Arg Val Lys Val Asp Asp Asp Leu Cys He Glu Val Thr 

60 515 520 525 

gac aac ggc cgc ggg ctg ecc gac gag ttc acc gga age ggc tta acg 
1632 

Asp Asn. Gly Arg Gly Leu Pro Asp Glu Phe Thr Gly Ser Gly Leu Thr 
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530 535 540 

?680°^^ egg cag egg gca gag cag gco gge gge gaa tto ace cte .gcg 

5 Asn Leu Arg Gin Arg Ala Glu Gin Ala Gly Gly Glu Phe Thr Leu Ala 

550 555 . • ■ 560 

1728^^ "==5 ggc gcg age gga aca gtg ctg ega tgg tea gca ccg ttg 

10 Ser Val Pro Gly Ala Ser Gly Thr Val Leu Arg Trp Ser Ala Pro Leu" 

565 570 575 

teg cag 
1734 
15 Ser Gin 



<210> 8 

20 <211> 578 

<212> PRT 

<213> Mycobacterium tuberculosis 



25 <400> 8 

Thr Gly Gly Leu Val Asp Glu 

10 . " . 15 



Met Thr Thr Gly Gly Leu Val Asp Glu Asn Asp Gly Ala Ala Met Arg 



30 



Pro Leu Arg His Thr Leu Ser Gin Leu Arg Leu His Glu Leu Leu Val 
20 . 25 . . 30 . 



35 Glu Val Gin Asp Arg Val Glu Gin lie Val Glu Gly Arg Asp Arg Leu 
35 40 . 45 

/in f ^^"^ '^•'•^ 1*®^ Val Thr Ala Gly Leu Asp Leu 

50 55 gQ 



45 



50 



Glu Ala Thr Leu Arg Ala He Val His Ser Ala Thr Ser Leu Val Asp 
" . . 70 -75 80 

Ala Arg Tyr Gly Ala Met Glu Val His Asp Arg Gin His Arg Val Leu 
85 90 95 

His Phe Val Tyr Glu Gly He Asp Glu Glu Thr Val Arg Arg He Glv 
100 105 110 



55 His Leu Pro Lys Gly Leu Gly Val He Gly Leu Leu He Glu Asp Pro 
115 120 125 

Ser Ala His Pro Ala Ser He Gly 
oO 130 135 140 ■ ^ 

Phe Pro Pro Tyr His Pro Pro Met Arg Thr Phe Leu Gly Val Pro Val 

150 155 3^50 
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Arg Val Arg Asp Glu Ser Phe Gly Thr Leu Tyr Leu Thr Asp Lys Thr 
165 170 175. 

5 ■ . 

Asn Qly Gin Pro Phe Ser Asp Asp Asp Glu Val Leu Val Gin Ala Leu 
180 185 190 

10 

Ala Ala Ala Ala Gly He Ala Val Ala Asn Ala Arg Leu Tyr Gin Gin 
195 200 205 

15 Ala Lys Ala Arg Gin Ser Trp He Glu Ala Thr Arg Asp He Ala Thr 
210 215 . 220 

Glu Leu Leu Ser Gly Thr Glu Pro Ala Thr Val Phe Arg Leu Val Ala 
20 225 230 235 240 

Ala Glu Ala Leu Lys Leu Thr Ala Ala Asp Ala Ala Leu Val Ala Val 
245 \ 250 255 

25 

Pro Val Asp Glu Asp Met Pro Ala Ala Asp Val Gly Glu Leu Leu Val 
260 265 270 

30 ' 

He Glu Thr Val Gly Ser Ala Val Ala Ser He Val- Gly Arg Thr He. 
275 280 285 

35 Pro Val Ala Gly Ala Val Leu Arg Glu Val Phe Val Asn Gly He Pro 
290 295 300 " 

Arg Arg Va:i Asp Arg Val Asp Leu Glu Gly Leu Asp Glu Leu Ala Asp 
40 305 310 315 320 

Ala Gly Pro Ala Leu Leu Leu Pro Leu Arg Ala Arg Gly Thr Val Ala 
325 330 335 

45 

Gly Val Val Val Val Leu Ser Gin Gly Gly Pro Gly. Ala Phe Thr Asp 
340 345 350 

50 

Glu Gin Leu Glu Met Met Ala Ala Phe Ala Asp Gin Ala Ala Leu Ala 
355 360 365 

55 Trp Gin Leu Ala Thr Ser Gin Arg Arg Met Arg Glu Leu Asp Val Leu 
370 375 380 

Thr Asp Arg Asp Arg He Ala Arg Asp Leu His Asp His Val He Gin 
60 385 390 395 400 



Arg Leu Phe Ala He Gly Leu Ala Leu Gin Gly Ala Val Pro His Glu 
405 410 415 
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A±:g Asn Pro Glu Val Gin Gin Arg Leu Ser Asp Val Val Asp Asp Leu 
420 425 430 



10 



Gin Asp Val lie Gin Glu He Arg Thr Thr He Tyr Asp Leu His Gly 
435 440 . 445 

Ala Ser Gin Gly lie Thr Arg Leu' Arg Gin Arg He Asp Ala Ala Val 
450 455 460 



15 Ala Gin Phe Ala Asp Ser Gly Leu Arg Thr Ser Val Gin Phe Val Gly 
465 470 475 480 

Pro Leu Ser Val Val Asp Ser Ala Leu Ala Asp Gin Ala Glu Ala Val 
20 485 490 495 

Val Arg Glu Ala V$l Ser Asn Ala Val Arg His Ala Lys Ala 5er Thr 
.500 505 510 

25 - 

. Leu Thr Val Arg Val Lys Val Asp Asp Asp Leu Cys lie Glu Val Thr 
515 * 520 525 

30 

Asp Asn Gly ^ Arg Gly Leu Pro Asp Glu Phe Thr iGly Ser Gly Leu Thr 
. 530 - 535 540 T " - 

• .... ■ - - ■ ' ^ - ^ - J 

35 Asn Leu Arg Gin Arg Ala Glu Gin Ala Gly Gly Glu Phe Thr Leu Ala 
- 545 550 555 560 

Ser Val Pro Gly Ala Ser Giy Thr Val Leu Arg Trp Ser Ala Pro Leu 
40 565 570 . 575 

Ser Gin 

45 

<210> 8 
<211> 578 
<212> PRT 

<213> Mycobacterium tuberculosis 

50 

<400> 8 

Met Thr Thr Gly Gly Leu Val Asp Glu Asn Asp Gly Ala Ala Met Arg. 
55 1 5 10 15 



Pro Leu Arg His Thr Leu Ser Gin Leu Arg Leu His Glu Leu Leu Val 

60 



20 - 25 |V- - . 30 



Glu Val Gin Asp Arg Val Glu Gin He val Glu Gly Arg Asp Arg Leu 
35 40 45 
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Asp Gly Leu Val Glu Ala Met Leu Val Val Thr Ala Gly Leu Asp Leu 
SO . 55 60 

5 • ■ 

Glu Ala Thr Leu Arg Ala lie Val His Ser Ala Thr Ser Leu Val Asp 
65 70 - 75 80 

10 "Ala Arg Tyr Gly Ala Met. Glu Val His Asp Arg Gin His Arg Val Leu 

85 90 95 

His Phe Val Tyr Glu Gly lie Asp Glu Glu Thr Val Arg Arg He Gly 
15 100 105 110 



20 



25 



40 



45 



His Leu Pro Lys Gly Leu Gly Val He Gly Leu Leu He Glu Asp Pro 
115 120 125 - 

Lys Pro Leu Arg Leu Asp Asp Val Ser Ala His Pro Ala Ser He Gly 
130 135 140 

Phe Pro Pro Tyr His Pro Pro Met Arg Thr Phe Leu Gly Val Pro Val 
145 150 155 160 



30 Arg Val Arg Asp Glu Ser Phe Gly Thr Leu Tyr Leu Thr Asp Lys Thr 

165 170 . 175 ■ • 

Asn Gly Gin Pro Phe Ser Asp Asp Asp Glu Val Leu Val Gin Ala Leu 
35 180 185 190 



Ala Ala Ala Ala Gly He Ala Val Ala Asn Ala Arg Leu Tyr Gin . Gin 
195 200 205 

Ala Lys Ala Arg Gin Ser Trp He Glu Ala Thr Arg Asp He Ala Thr 
210 . 215 220 

Glu Leu Leu Ser Gly Thr Glu Pro Ala Thr Val Phe Arg Leu Val Ala 
225 230 235 240 



50 Ala Glu Ala Leu Lys Leu Thr Ala Ala Asp Ala Ala Leu Val Ala Val 

245 250 255 



Pro Val Asp Glu Asp Met Pro Ala Ala Asp Val Gly Glu Leu Leu Val 
55 260 265 270' 



60 



He Glu Thr Val Gly Ser Ala Val Ala Ser He Val Gly Arg Thr He 
275 280 • 285 



Pro 



Val Ala Gly Ala 
290 



Val Leu Arg Glu Val Phe Val Asn Gly He Pro 
295 300 
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Arg Arg Val Asp Arg Val Asp Leu Glu Gly Leu Asp Glu Leu Ala Asp 
305 310 315 320 

5 

Ala Gly Pro Ala Leu Leu Leu Pro Leu Arg Ala Arg Gly Thr Val Ala 
325 330 335 

10 Gly Val Val Val Val Leu Ser Gin Gly Gly Pro Gly Ala Phe Thr. Asp 
340 345 350 

Glu Gin Leu <31u Met Met Ala Ala Phe Ala Asp Gin Ala Ala Leu Ala 
15 355 360 365 

Trp Gin Leu Ala Thr Ser Gin Arg Arg Met Arg Glu Leu Asp Val Leu 
370 . 375 380 

20 { } 

i f 

Thr Asp Arg Asp Arg lie Ala Arg Asp Leu His Asp Kis Val- lie Gin 
385 390 .395 400 



25 



Arg Leii Phe Ala lie Gly Leu Ala Leu Gin Gly Ala Val Pro His Glu 
405 410 415 



30 *Arg Asn Pro Glu Val Gin Gin Arg Leu Ser Asp Val Val Asp Asp Leu 
420 425 430 

Gin Asp Val He Gin Glu He Arg Thr Thr He Tyr Asp Leu His Gly 
35 435 - . 440 . . 445 



40 



Ala Ser Gin Gly He Thr Arg Leu Arg Gin Arg He Asp Ala Ala Val 
450 455 460 



45 



Ala Gin Phe Ala Asp Ser Gly Leu Arg Thr Ser Val Gin Phe Val Gly 
465 . 470 475 480 

Pro Leu Ser Val Val Asp Ser Ala Leu Ala Asp Gin Ala Glu Ala Val 
485 490 495 



50 Val Arg Glu Ala Val Ser Asn Ala Val Arg His Ala Lys Ala Ser Thr 
.500 505 510 

Leu Thr Val Arg. Val Lys Val Asp Asp Asp Leu Cys He Glu Val Thr 
55 515 520 525 



60 



Asp Asn Gly Arg Gly Leu Pro Asp Glu Phe Thr Gly Ser Gly Leu . Thr 
. 530 535 540 



Asn Leu Arg Gin Arg Ala Glu Gin Ala Gly Gly Glu Phe Thr Leu Ala 
545 550 • . 555 560 
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Ser Val Pro Gly Ala Ser Gly Thr Val Leu Arg Trp Ser Ala Pro Leu 
565 570 575 

Ser Gin 



10 <210> 9 

<211> 141 ' 

<212> PRT 

<213>. Streptomyces coeli color 

15 

<400> 9 

Met Thr Arg Arg lie Arg Asp Val Met Ser Pro Ala Ala Val Ala Val 
1 ' 5 10 15 

Glu Pro Met Thr Thr Val Ala Arg Ala Ala Arg Leu Met Arg Glu Glu 
20 . ' * 25 30 

Asp Val Gly Asp Val Leu Val Thr Tyr Asp Cys Asp Leu Phe Gly Val 
25 35 40 45 

Leu Thr Asp Arg Asp lie Val Leu Arg Gly Val Ala Asp Gly Arg Asp 
50 55 60 . 

30" Ser Glu Ala Thr Thr Val Gly Ala Val Cys Thr Pro Pro Pro Val Val 

65 . 70 75 80 



20 



35 



45 



50 



Thr Leu Glu Pro Asp Asp Thr Thr -Asp Arg Ala Ala Glu Leu Met Ala 
85 . 90 .95* 

Arg His Ala Val Arg Arg Leu Pro Val Val Glu His Gly Gly Val Pro 

100 105 110 



Val Gly Val Val Thr Leu Gly Asp Leu Ala Ala Thr Asp Asp Pro His 

40 115 120 125 

Ser Ala Leu Ala Asp lie Ser Arg Ala Ala Pro Gly His 

130 . 135 ■ .. 140 



<:210> 10 • 

<211> 138 

<212> PRT 

<213> Pseudomonas aeruginosa 



<400> 10 



Met Lys lie Ser Asp lie Met Thr Arg Asn Val Gin Val Ala Asp Pro 
55 1 . 5 10 .. 15 . 

Gin Gin Ser lie Arg Glu Ala Ala Ala Thr Met Ala Arg lie Asp Ser 
20 25 30 

60 Gly Ala Leu Leu Val Gly Glu Gly Asp Arg Leu Val Gly Met lie Thr 
35 40 45 

Asp Arg Asp lie Ala lie Arg Ala Val Ala Gly Gly Leu Ser Gly Asp 
50 55 60 
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Thr Pro Leu Gly Arg lie Met Ser Gly Asp lie His Tyr Cys Phe Glu 
65 70 75 80 

5 Asp Glu Asp Val Gin His Val Ala Arg Asn Met Ala Asp lie Gin Met 

85 90 95 

Arg Arg Leu Pro Val Leu Asn Arg Glu Lys Arg Leu Val Gly Val Val 
100- 105 110 

10 

Ser Leu Gly Asn He Ala Ser Cys Arg Asp Gin Ala Ser Ser Ala Thr 
115 120 • 125 

Val Leu Gin Gly Val Ala Gin Ala His Tyr 
15 130 135 

<210> 11 ■ 

<211> 140 

20 <212> PRT 

<213> Bacillus subtilis 

<400> 11 . 

25 . 

Met Ser Ser Val Lys Asp Thr Met Thr Thr Gin Val Ala Thr Val Ser 
' 1 ■ 5 '10 15 

Pro Asn Gin Thr He Gin Glu Ala Ala Ser Leu Met Lys Gin His Asn 
30 * .20 25 30 

Val Gly Ala lie Pro Val Val Glu Gin Gly Val Leu Lys Gly Met Leu 
35 40 .. 45 

35 Thr Asp Arg Asp He Ala Leu Arg Thr Thr Ala Gin Gly Arg Asp Gly 

50 55 ■ 60 . 

Gin Thr Pro Val Ser Glu Val Met Ser Thr Glu Leu Val Ser Gly Asn 
65 70 75 80 

40 

Pro Asn Met Ser Leu Glu Asp Ala Ser Gin Leu Met Ala -Gin His Gin 
85 90 95 

He Arg Arg Leu Pro He Val Asp Gin Asn Asn Leu Val- Gly He Val 
45 100 105 110 

Ala Leu Gly Asp Leu Ala Val Asn Gin Met Ser Asn Glu Ser Ala Gly 
115 120 125 

50 Ser Ala Leu Thr Asn He Ser His Gin Asn He His 
130 135 140 



55 



